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Influence of Bulk and Surface Properties on 
Image Sensing Silicon Diode Arrays 


By T. M. BUCK, H. C. CASEY, Jr., J. V. DALTON, and M. YAMIN 


(Manuscript received March 7, 1968) 


Silicon diode arrays for use as the electron-beam accessed target in camera 
tubes for the Picturephone® visual telephone set have been fabricated and 
their properties evaluated. These targets offer significant advantages over the 
antimony trisulfide target commonly used in vidicon-type tubes. But there 
are certain potential limitations which must be dealt with in developing 
a silicon target. Three of its critical requirements are adequate sensitivily 
to visible light, low dark current, and junction untfornuty and freedom 
from defects across at least 300,000 diodes per square centimeter. Sensitivity 
to visible light 1s expressed here by the efficiency for conversion of incident 
photons to electrons in the read-out circuit. Conversion efficiencies exceeding 
50 percent in the visible region have been achieved by oxidizing or by diffusing 
phosphorus into the light-receiving surface to reduce the surface-recombina- 
tion velocity. Diode leakage currents of <1 X 10°” A per diode are re- 
quired, and are obtained for target voltages up to about 5 to 7 V. Surface 
generated current dominates in the 8-u diameter diodes of the array, but 
this convponent of current can be reduced substantially by use of (100) 
surfaces or by hydrogen annealing. Visible defects in a picture can result 
from leaky diodes or oxide pinholes which cause bright spots, and diodes 
covered by oxide which cause dark spots. Our best targets show «a video 
display with only a few defects; processing must be improved to eliminate 
defects completely. 
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I, INTRODUCTION 


A television camera tube with a silicon diode array target has been 
reported recently by Crowell, Gordon and their co-workers.1 A target 
of the general type used in this tube was first proposed in 1951 by 
Reynolds,* later discussed by Heijne,® and recently analyzed by Wend- 
land. It is similar, but not identical in operation, to the evaporated- 
film photoconductive target, typically antimony trisulfide, which is 
commonly used in vidicon TV camera tubes.’ 

A vidicon-type tube is of interest for use in the Picturephone® 
visual telephone station set because it is the least expensive and 
smallest camera tube that has the required sensitivity and resolution. 
The silicon target has certain potential advantages over the evap- 
orated-film photoconductive target. 

This paper describes the effect of bulk and surface properties on 
the performance of the silicon target. The properties that dominate 
the conversion of incident photons to electrons in the external circuit 
and the diode leakage current are analyzed in detail. These analyses, 
together with the relevant processing techniques and resulting be- 
havior, have been combined into a description of silicon diode arrays 
for image sensing. 

The operation of the silicon target is illustrated in Fig. 1. A scan- 
ning electron beam charges the diode-array side of the silicon target 
down to cathode (ground) potential while the n-region is held a few 
volts above ground. This puts reverse bias on the diodes. Light shin- 
ing on the other side of the target and absorbed in the n-region gen- 
erates holes, some of which diffuse to the diodes and reduce the nega- 
tive charge on the p-regions. This reduction establishes a stored 
charge pattern. The scanning electron beam returning to the site of 
the diode deposits more negative charge, an amount proportional to 
the light intensity. The recharging current constitutes the video signal. 
Leakage of the charge pattern established by the light is prevented by 
the rectifying p-n junction rather than by high bulk resistivity as in 
the case of antimony trisulfide. The usual time between scans of the 
electron beam at a given diode site is 1/30 second. 

The silicon target has several advantages over evaporated-film 
photoconductive targets such as antimony trisulfide: 


(z) It does not show aging effects (burn-in) by intense light to which 
it might be exposed accidentally, or by the electron beam. The absence 
of burn-in by the electron beam permits electronic zooming. 

(iz) Its photoconductive lag is negligible. 
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Fig. 1— Schematic diagrams of silicon target, illustrating principle of operation. 


(iz) It does not deteriorate on heating to temperatures sufficiently 
high (4350°C) for good tube processing required for long life. 


There are, however, certain potential problems and limitations 
which must be considered in developing a successful silicon diode 
array target for a camera tube. Three of these, related to materials 
and process factors, are: 


(t) The target must have adequate sensitivity to visible light. We 
express sensitivity as conversion efficiency 7, which is defined as the 
ratio of electrons that flow in the external circuit to the number of 
incident photons. For photons in the 0.45 to 0.8-1 wavelength range, 
a value of 7, > 20 percent would be satisfactory. 

(ii) The total dark current should be less than 50 X 10°° A, which 
means diode leakage current for each of the approximately 1/2 million 
diodes must be S 1 X 10°” A, so that only a negligible amount of 
charge will leak off between scans of the electron beam. For camera 
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tube applications total diode capacitance is restricted to a rather narrow 
range. Preliminary results indicate that a substrate resistivity of 10 
ohm-cm, which yields 2000 pI'/cm’ at 10 V reverse bias, is close to 
the optimum.” 

(iz) The whole array must have uniform properties and be free of 
defects which can cause bright spots or dark spots in the display-tube 
picture. 


Tube performance is, of course, the ultimate test of a good target, 
but for studies of efficiency and diode leakage it was convenient to 
make measurements outside the tube. We demonstrate the relationship 
between these measurements and the actual tube performance. 


II. TARGET STRUCTURE 


The target, as illustrated in Fig. 2, is a thin disk of n-type silicon 
with an array of p-n diodes on one side. These are the sensing ele- 
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Fig 2— Details of target structure (660 x 660 array). 
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ments. Silicon dioxide covers the n-type silicon between the diodes to 
keep the electron beam from landing there. The oxide also protects 
the junction edges and reduces surface leakage. In a typical design 
the target is 0.850 inch in diameter, 0.5 to 1 mil thick in the light 
sensing region, and has a 4-mil-thick rim for support. The diodes are 
8 » in diameter and are on 20 » centers in an array of 660 x 660. 

Outside the main array are 5 mil p-n diodes which can be probed 
for measurements at various stages of processing. There are also gold 
dots over the oxide for MOS measurements and gold dots over test 
arrays of 8 » diodes for reverse current measurements which simulate 
dark current measurements in the tube. Ohmic contact to the n-region 
is made through an annular n+ region which is in the thick ring on 
the light-receiving side of the target. The dimensions just given are 
representative of many targets which have been made, although larger 
and denser arrays are also being studied. 

Planar technology is used in the fabrication with the following 
deviations from typical device processing: 


(z) One-step diffusions are used, with no drive-in. 

(wz) There is no postdiffusion reoxidation and therefore no second 
photoresist step or reregistration. . 

(wz) The phosphorus diffusion for ohmic contact is the last high 
temperature step, for reasons discussed near the end of Section 3.3. 


An additional processing step, the deposition of a semi-insulating 
film over the diode side, is usually performed before the target is 
mounted in a tube. The purpose of this film is to dissipate charge, 
deposited by the electron beam, from the target area between the 
diodes. Several films have been developed for this purpose by Crowell 
and Labuda.? Targets described in this paper did not have such films 
except in cases where tube measurements are mentioned. 


III. CONVERSION EFFICIENCY 


3.1 Calculation of Conversion Efficiency 


If a silicon diode array target is to replace the antimony trisulfide 
target in a vidicon-type camera tube, its sensitivity should approxi- 
mate or exceed that of antimony trisulfide targets. We describe sensi- 
tivity in terms of the conversion efficiency 7. which is defined as the 
ratio of the number of electrons that flow in the external circuit to 
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the number of incident photons. Efficiency of antimony trisulfide tar- 
gets is typically 20 percent at 5500 A and falls off toward both ends 
of the visible region. For silicon targets the conversion efficiency will 
depend on the sample thickness, the wavelength of the light and 
several properties of the semiconductor. In order to identify the im- 
portant parameters and their effects on the conversion efficiency, the 
steady-state, short-circuit current in a one-dimensional model of a 
single p-n junction has been calculated. 

The most important relation governing distribution of the optically 
generated carriers in the semiconductor is the continuity equation. 
The continuity equation formulation for the transport of carriers has 
been rigorously treated by van Roosbroeck.® From that treatment the 
steady-state, small-signal differential equation for the minority car- 
rier density in excess of the equilibrium concentration at zero total 
current and electric field in the one dimension z is 


2 
p, SBS — 22) _ _ eq), ) 
where p(x) is the excess minority carrier density, D, the hole diffu- 
sion coefficient, 7, the hole lifetime, and G(x) the net carrier genera- 
tion rate. 

At one boundary, the surface at + = 0, the hole flux as determined 
by the surface-recombination velocity S must equal the diffusive flux: 


iO) = aSp0) = aD, FE ) 


The other boundary condition refers to the edge of the junction 
space-charge region located at depth x = d from the illuminated sur- 
face. For the short-circuit condition, the excess hole density at the 
junction edge is zero, 


and the short-circuit current density 1¢¢ 1s 
: d 
i= gD, Fl. (4) 


When the generation rate for carriers is governed by Lambert’s law 
of photon absorption, the net generation rate may be written as 


_Q—R)N 
- A 


G(x) a exp (—az), (5) 
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where FR is the reflectivity, N the number of incident photons per 
unit time, A the cross-sectional area, and «a the optical absorption 
coefficient. The conversion efficiency, neglecting absorption in the 
junction space-charge region, becomes 





_p 
Lac ar r=d 
=> a = OC" 6 
Te N/A N/A (6) 


The conversion efficiency may be obtained from equation (1) with 
the conditions expressed by equations (2) through (5) and the defini- 
tion of equation (6). Since the generation rate G(x) has a dependence 
on wavelength through both the reflectivity and absorption coef- 
ficient, », will depend on wavelength. As Section 3.3 describes in detail, 
difficulty has been encountered in reconciling the experimental and 
calculated y, for illuminated surfaces that have been etched and aged 
in air. The experimental y, for the etched surface is always signifi- 
cantly less than the calculated »,, even with very high S, when a sig- 
nificant portion of the carriers are generated near the illuminated 
surface. 

This situation is similar to Wittry and Kyser’s® ° experiences with 
cathodoluminescence in GaAs. Their cathodoluminescence intensity 
was less than could be explained by a high surface recombination 
alone. They assumed that minority carriers generated between the 
surface and a depth 8 are not effective in producing recombination 
radiation. Similarly, in the present work it was found necessary to 
modify the generation rate given by equation (5) to account for a 
“dead layer” at the surface in order to obtain agreement between the 
calculated and experimental ¢. 

The effect of a “dead layer” on the conversion efficiency has been 
introduced into the analysis by assuming that the carriers generated 
within a distance § of the illuminated surface cannot diffuse to the 
junction and be collected. For a solid in which all the incident pho- 
tons, less those lost by reflection, are absorbed in creating hole- 
electron pairs, the number of carriers per unit area generated be- 
tween the surface and x = 8 is found by integrating G(x) dz between 
the limits 0 and 8. Thus, there will be (1—R)N[1 — exp(—a8)] car- 
riers generated within a distance 8 of the surface. The number of 
holes that may be collected in the absence of a “dead layer” or sur- 
face and bulk recombination is equal to the number of absorbed pho- 
tons and is simply (1—F)N. If the carriers generated between x = 0 
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and 8 are lost and not available for collection, then the carriers 
available for collection in the absence of other losses is simply the 
difference of these two quantities, (1—R)N exp(—a8). Therefore, a 
generation rate of 





Ge) = E ats a exp (—aa) | exp (—aéd), (7) 


when integrated from 0 to 0, will give the same number of collectable 
carriers as [(1—R)N/A] « exp(—av) integrated from § to co: that is, 


/ Gyan | CG de. 


This formalism, which has been found useful in representing the effect 
of the surface space-charge region on optically generated carriers, 
permits retaining the field-free continuity equation and the represen- 
tation of surface recombination by S. The conversion efficiency for 
the generation rate given by equation (7) is 


_ exp (—a8)(1 — Ral, [¢ + oL?/r,) sech tL) | 
> ee (S tanh (d/L,) + L,,/7,) 


, Ssech? (d/L,) 
— at, + tanh (d/Z,) + ae ee | exp (—ad)}, 
(8) 


where the minority carrier diffusion length LZ, is given by L, = 
(Dyr,)*. The efficiency y- will be expressed in percent. To further 
describe the ‘dead layer” and delineate the role of bulk and surface 
recombination on the target sensitivity, it is necessary to compare the 
experimental y, variation as a function of wavelength with y. calcu- 
lated from equation (8). 


3.2 Lxperimental Procedure for Efficiency Measurements 


To evaluate the properties of the silicon diode arrays, it was con- 
venient to make measurements outside the tube. Targets for this 
purpose had large diodes and had received the same processing as 
the 8-» diode arrays except that the semi-insulating film was omitted. 
Figure 3 illustrates the structure used. The pattern with diodes from 
5 to 40 mils in diameter was used so that the effect of diode diameter 
on 7 for a given light-spot size could be determined. Because equa- 
tion (1) applies only to a one-dimensional problem, one-dimensional 
experimental conditions must be achieved. These conditions were 
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Fig. 3— Array of large diodes for studies of conversion efficiency. and diode 
leakage. 


taken to apply whenever further increase in diode diameter did not 
increase the conversion efficiency, which occurred for diameters 
greater than 30 mils. The y, measurements reported here were taken 
with the 40-mil diameter diodes. 

The experimental arrangement for measuring the conversion efficiency 
is illustrated in Fig. 4. Emission from the tungsten 250 W quartz-iodine 
lamp was filtered by a Corning filter C. S. 1-69 (heat-absorbing glass) 
to reduce the infrared intensity. Plane mirror M, and spherical mirror 
M, focused the light onto the entrance slit of the Perkin-Elmer model 
99 single-prism spectrometer. The light was chopped at the spectrometer 
entrance at 37.5 Hz. An Optics Technology band-pass filter was used 
at each measurement wavelength to prevent light of undesired wave- 
lengths from being transmitted through the spectrometer. The light 
from the spectrometer exit slit could be directed to the sample or to a 
calibrated thermocouple by the movable mirror. The spherical mirror 
M, focused the radiation onto the thermocouple whose output was 
measured with the Princeton Applied Research model HR-8 lock-in 
amplifier. 

The spot size of approximately 4 X 12 mils on the sample was obtained 
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Fig. 4 Experimental arrangement for measurements of conversion efficiency. 


by reducing the height of the spectrometer slits and using M,, a 90° 
ellipsoidal mirror with a 6 : 1 reduction. The short-circuit current was 
determined by measuring the voltage across a 100 KQ resistor connected 
between the n and p sides of the diode. To maintain junction short- 
circuit conditions, the light intensity was kept low enough so that 
the junction voltage was S 0.1 kT/q(S 0.002 V). Linearity with 
light intensity was confirmed with neutral density filters and absence 
of significant leakage currents could be demonstrated by linear varia- 
tion of junction voltage with load resistance. 

In order to make quantitative comparison of the experimental and 
calculated n, the absolute photon flux must be known. For this purpose, 
a silicon solar cell was calibrated by comparison with several calibrated 
thermopiles. Then the calibrated solar cell was placed in the position 
of the sample and the spectrometer thermocouple was calibrated. The 
absolute photon flux can be assigned an uncertainty of +10 percent. 


3.3 Experimental and Calculated Conversion Efficiency 

In this section the control of target efficiency by process variations is 
discussed. Experimental efficiency data are compared with calculated 
curves, and this permits determination of the parameters L,, S, and 
8. Consideration of equation (8) shows that the best discrimina- 
tion between L,, S, and § is obtained when d is two to four times Ly. 
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Preliminary measurements indicated that a 4-mil target thickness was 
suitable. The wavelength dependence of the absorption coefficient « 
was taken from the data of Dash and Newman,™ and the reflectivity 
F for an etched silicon surface was taken from Philipp and Taft’s 
data.1” The hole diffusion coefficient D, was assigned a value of 10 
cm? per second. Given these experimental dependences of « and R, 
the wavelength becomes the independent variable for equation (8), 
and the variation of », with wavelength is determined by the physical 
parameters d, L,, S, and 8. 

Initially, consider the data in Fig. 5 for the etched surface aged in 
air (the dots). The calculated y, curve for a generation rate expressed 
by equation (5) (no “dead layer’’) is shown by the dashed line. The 
diffusion length Z, has been assigned a value of 50 u to produce agree- 
ment between the calculated and experimental y, in the long wave- 
length region. The use of the generation rate of equation (5) is equiv- 
alent to 6 = O in equation (8) for n,. Even with the maximum value 
of surface-recombination velocity Smax = (kT'/2rm)' = 10’ em per 
second, the calculated y, does not decrease as rapidly at short wave- 
lengths as the experimental 7, . The discrepancy between the calculated 
and experimental y, for wavelengths in the visible region leads to 
the “dead layer” concept. 

Fitting the data in Fig. 5 with a finite 8 in equation (8) does not 
lead to unique values of S and 8. However, their values are limited to 
a reasonably narrow range. For example, by plotting the experimental 
data with the calculated y, (lower curve, Fig. 5) it is not possible to 
discriminate between values of S = 10° cm per second, § = 0.8 p», and 
S = 6 X 10‘ cm per second, § = 1.8 ». The larger 8 value was obtained 
with a nonlinear least-squares technique described by Marquardt.%° 
The high surface-recombination velocity agrees with earlier studies 
of Buck and McKim** and Harten?® in which it was shown that S 
is normally very high on an etched silicon surface. Harten’s'® meas- 
urement technique was similar to the one described here. To resolve 
this ambiguity in S and 8, experimental data at wavelengths less than 
0.45 » are necessary. Because the efficiency of the diode is rapidly 
decreasing and the intensity of the light source is also becoming 
smaller, measurements in this wavelength range are not presently 
possible. The significant point is that the “dead layer” thickness is 
approximately a micron and S is very high. 

In order to gain insight into the significance of the “dead layer,” 
the surface potential was determined by surface conductivity meas- 
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Fig. 5— Conversion efficiency as function of wavelength for etched and HF- 
treated surfaces. Target is 4 mils thick. 


urements" on silicon slices from the same 10 ohm-cm n-type crystal. 
The surface was found to be in a condition of depletion with the po- 
tential varying from —0.25 V at about 14 h after etching to —0.6 V 
(energy bands bent upward) after several hours of aging in air. The 
variation of potential within the surface depletion region was obtained 
from Poisson’s equation’® for a donor concentration of 6.0 x 10% 
em. Because the electric field in the surface depletion region goes 
to zero very slowly it is difficult to define a depletion-region depth. 
If, however, the potential from Poisson’s equation is approximated 
by a simple parabolic potential of the form 


V(a) = (V./x2) (x a= els (9) 


where V, is the surface potential, then the depletion layer thickness may 
be approximated by x, . The quantity x, is determined by a reasonable 
fit from x & 0.1z, to x & 0.8x,. For a surface potential of —0.25 V, 
x, 18 0.8 » and for —0.60 V, xz, is about 1.1 yw. Therefore, the depletion 
layer thickness is about the same as the “dead layer.” 

Because the depth of the “dead layer” and the surface depletion 
region are about the same, it is reasonable to attribute the “dead 
layer” to the surface depletion region. This assumption suggests that 
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the “dead layer” results from the built-in field at the surface whose 
direction is such as to oppose the diffusion of holes to the junction. 
To test this hypothesis, reversal of the surface field should eliminate 
the “dead layer” because the field for holes would be in the same 
direction as the diffusion toward the junction. It has been shown** 25 
that a hydrofluoric acid treatment bends the energy bands downward 
at a silicon surface temporarily. In the present work a surface poten- 
tial of +0.2 V was determined after such treatment. The upper curve 
in Fig. 5 shows that the experimental data for an HI'-treated surface 
may be fitted with the same L,, but no “dead layer” is required. Aging 
in air causes a shift in surface potential back to a depletion condition 
and a response that requires a ‘dead layer” correction. 

Although a 4-mil target thickness is useful in efficiency studies for 
discriminating among the critical parameters, a target for a camera 
tube should be <1 mil thick for adequate resolution. For a 1 mil thick- 
ness Fig. 6 shows two sets of experimental data together with calculated 
curves which illustrate the importance of S and 6. Consider first the 
experimental data for the etched surface (the dots). The L, of 32 u 
was obtained from the 7, measurement at a thickness of 4 mils before 
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Fig. 6— Influence of surface-recombination velocity on efficiency for 1-mil 
thick target. 
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the target was thinned to 1 mil. The values of S = 10’ em per second 
and 6 = 0.8 » were previously shown to be representative of an etched 
surface. 

The dashed lines are calculated for S = 10’ cm per second and 
6 = 0 or 0.8 pu to illustrate the effect of the ‘‘dead layer” at this thick- 
ness. It may be seen that the efficiency in the visible region drops from 
10 percent at 0.7 uw to 1 percent for high S and no “dead layer,” but 
drops to 0.1 percent for the 0.8 uw ‘dead layer.” Calculations based 
on equation (8) show that when d S JL,, further increase in ZL, does 
not improve efficiency very much. This also means that efficiency is 
insensitive to small lateral variations in d which may occur during 
etching. However, S and 6 have a strong influence on efficiency and 
these parameters must be substantially decreased. The HF treatment 
and other chemical treatments*'’® on the etched surface which reduce 
S and 6 are too unstable for long life in a camera tube. 

A more permanent improvement in efficiency can be made by high 
temperature oxidation of the surface. Experimental efficiency data 
taken after a light steam oxidation (900°C for 10 minutes) are also 
shown in Fig. 6 (circles). Oxidation raised efficiency above 10 per- 
cent at the blue end of the visible region. Values of S = 3 X 10* cm 
per second and $ = 0 yield a calculated efficiency curve which fits the 
experimental points. Values of S for steam oxide with no further treat- 
ment have been as low as 10° cm per second which gives an efficiency 
of 50 to 60 percent. The 800 A oxide also serves as an antireflection 
coating which contributes an additional slight increase in efficiency. 

Experimentally determined reflectivity of the oxidized surface was 
used in equation (8) to obtain the solid curves shown in Fig. 6. No 
“dead layer” correction was needed for the oxidized surface. This is 
attributed to the fact that the oxidized surface is more n-type than 
the bulk; the energy bands are bent downward at the surface, elimi- 
nating the depletion layer. The HF soak eliminated the “dead layer” 
temporarily for the same reason, that is, it bends the bands down at 
the surface. 

It is assumed that oxidation reduces S both by reducing the density 
of recombination centers and by shifting surface potential, although 
the data to confirm this are not complete. Surface potential is shifted 
from about —0.6 V, the previously mentioned depletion-layer con- 
dition for an etched surface, to +0.2 V (bands bent downward). Fast 
state density is not known for the etched surface but is 3 X 1072 em” 
eV- for the oxidized surface. The S values of 10° to 3 X 10* agree 
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reasonably well with values for oxidized surfaces reported by Rosier?” 
but not with the 5 to 10 cm per second values reported by Grove and 
Fitzgerald.18 However, approximate agreement with the Grove and 
Fitzgerald values is obtained for an oxide which has been through 
_boron and phosphorus diffusions, as discussed in Section 4.2. 

Oxidation can thus provide satisfactory efficiency, but there are 
two rather serious objections to oxidation for this application: 


(t) The oxidation must be done after the target is otherwise com- 
plete, and at this stage it frequently causes an increase in diode leak- 
age current. 

(72) Vacuum bake-out of the tube and target at 350°C, which is 
desirable for good tube processing, may increase the interface state 
density and the recombination velocity. 


Although both of these problems could be overcome by improved 
control of the oxide, an alternative procedure was found which ob- 
viated the final oxidation step for low S. Figure 7 shows the improve- 
ment in efficiency caused by a phosphorus diffusion on the light- 
receiving surface of a 4-mil thick target (circles). The phosphorus 
was diffused at 925°C for 10 min in PBrs vapor, yielding a depth of 
about 0.4 ». This reduced S to a nominal value of 50 cm per second 
and the diffusion length was increased to 52 yp», yielding an efficiency 
of about 20 percent. The efficiency has been 50 to 60 percent on 1-mil 
thick targets. 

Equation (8) is rather insensitive to surface-recombination velocity 
for S S 200 cm per second, and 50 cm per second is only given as an 
approximate value. The phosphorus data in Fig. 7 were obtained with 
the phosphate glass on the surface. Removal of the glass did not change 
the response. When the diffused phosphorus layer was removed, the 
efficiency dropped to the original level for an etched surface, except 
for an upward displacement at long wavelengths resulting from the 
improved bulk lifetime. In some cases Z, has been increased to 100 yu 
(r = 10 usec). The effect on S is evidently due to the built-in field of 
the diffused phosphorus layer which repels holes from the surface 
and causes a low recombination velocity. The principle of surface 
doping to reduce S was proposed by Moore and Webster” but we are 
not aware that a demonstration of it has been published. The slight 
droop in the phosphorus curve at about 0.5 u, requiring a “‘dead layer’ 
correction of 0.1 4, seems to result from diffusion damage. At present 
the best conditions that have been found to minimize this effect are 
850°C for 30 minutes. 


1842 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1968 





























= d=4 MIL 


@ ETCHED 











O PHOSPHORUS 


® DIFFUSED 
0.2 rae 
* s=107 cm/sec 











40 
20 
a S =50 cM/SEC 
ra 6 =0.11 
Go: AO ee 
c 8 
a 6 
2 4 
6 
ts 
8 
6) 
Zz 
w 
3. 
nw 0.8 
ue 
uw 
z 
e) 
Yn 
a 
UB) 
> 
z 
) 
O 











0.1 6=08p 
0.06 
0.04 
0.4 0.5 0.6 0.7 0.8 0.9 1.0 


WAVELENGTH, A, IN MICRONS 


Fig. 7 — Effect of phosphorus diffusion on efficiency. Lower curve shows con- 
dition with etched surface before diffusion. 


These results were all obtained on targets with large diodes, out- 
side the tube. Results for a silicon target in a tube are shown in Fig. 
8 where it is compared with a standard vidicon, and with the ideal 
case of unity conversion efficiency. Efficiency is expressed in units 
commonly used for vidicons, microamps of output current per micro- 
watt of incident radiation. The curve for unity efficiency slopes 
upward because at longer wavelengths there are more photons per sec- 
ond per microwatt of radiation, with each photon capable of excit- 
ing an electron-hole pair. Efficiency of the silicon target is more than 
twice that of the vidicon in the middle of the visible region, and the 
sensitive range is much broader. The conversion efficiency exceeds 50 
percent at a wavelength of 0.7 p. 

The diffused layer of phosphorus provides a very stable reduction 
in S which is unaffected by vacuum bake-out or deposition of anti- 
reflection coatings. Furthermore, the phosphorus treatment does not 
harm the diode characteristics; instead, it improves them as discussed 
in the next section. The phosphate glass must be removed from the 
p-type islands by a brief etch, but this requires no remasking. 
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IV. DARK CURRENT 


4.1 Bulk and Surface Generated Current 


Low dark current is another important requirement for this target. 
Current, in the absence of illumination, should not exceed 1 x 10° 
A per diode at 5 to 10 V reverse bias, or about 50 x 10-° A for the 
whole array. Currents slightly above this range reduce the dynamic 
range of the camera tube or the available picture contrast. Currents 
five times greater prevent integration of the incident light flux over a 
full television scan period. The most readily observable effect of ex- 
cessive dark current (more than 100 x 10° A) is “whiting out” of the 
picture on the display tube. 

The dark current required for good performance is considerably 
lower than for most silicon devices. The reverse current in a target 
array can be separated into the two general categories of bulk gen- 
erated current and surface generated current. An estimate of the bulk 
current can be obtained from the expression by Sah, Noyce, and 
Shockley®° for current generated in the space-charge region: 


vo ns = 
| paun,{2(r1, * gosh E - + 3In (=) } ; (10) 


In equation (10) w is the depletion width, n; the intrinsic carrier con- 
centration, r, and +r, the hole and electron minority carrier lifetimes 
on their respective sides of the junction and E, — E; the energy dif- 
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Fig. 8— Comparison of 1 mil silicon target with antimony trisulfide target. 


1844 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1968 


ference between an assumed single recombination-generation level 
and the intrinsic Fermi level. For an 8 » diode at 10 V bias with the 
common simplifying assignment of H; = H; and +r, = 0.1 psec > tn as 
lifetime values representative of diffused structures, the current would 
be predicted by equation (10) as approximately 3 x 107? A. Since 
this estimate of bulk current alone exceeds the permissible dark cur- 
rent, diode leakage current was regarded as a potential source of dif- 
ficulty for this device. 

Of several diffusion conditions tried, those which yielded the lowest 
bulk generated reverse currents were 1140°C for 20 minutes with a 
BBrs source. These conditions gave a sheet resistance of 5 to 6 ohms 
per square and a junction depth of 2 ». A subsequent phosphorus dif- 
fusion, which is required for ohmic contact and low S on the light- 
receiving surface, is an important part of the process. As a result of 
this treatment, the diffusion length was typically improved by a factor 
of three, while the reverse current, as measured on the 5-mil test 
diodes on the actual targets, was reduced by an order of magnitude to 
a median value of 107? A. In addition, the exponent in I « V” was 
reduced from ~1.0 to ~0.5. 

The improvement in both the diffusion length and dark current by 
the phosphorus diffusion is presumably caused by a gettering action 
by the phosphate glass on impurities, such as gold and copper. A get- 
tering effect has been suggested in several reports in which phosphorus 
treatments have improved silicon diode reverse characteristics**~* or 
minority carrier lifetime.?*?5 The improvements in reverse current 
already described are quite similar to those reported by Ing and his co- 
workers about p*n diodes of 0.92-cm material.?? In our work, neutron- 
activation analysis showed that a boron diffusion increased the gold 
concentration from approximately 4 x 10% cm? to about 2 x 10% 
em-°, while the phosphorus diffusion reduced it again to 4 X 10*? cm". 
Insufficient sensitivity obscured any similar effect on copper if it was 
present. Cleaning the substrate with nitric acid or aqua regia before 
diffusion gave better post-boron I-V characteristics than did cleaning 
treatments without a strong oxidizing acid. The phosphorus treat- 
ment then caused further improvement. 

The above process was developed using as control information the 
results of efficiency and dark current measurements on 5-mil test 
diodes available on target arrays, and also on the graduated-diameter 
diodes shown in Fig. 3. In addition to their function as control speci- 
mens, the graduated-diameter diodes were used to establish the rela- 
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tive importance of bulk and surface currents. This designation is made 
by assuming the total current to be a linear combination of an area 
and perimeter component: 


I, =Ig+I>e = @D’ + oD, (11) 


where D is the diode diameter. It was possible to estimate @ and @ 
graphically from a plot of I,/D vs D for the diodes of graduated diameter, 
and then calculate the J, and J, components for a diode of a particular 
diameter. In a typical case for a 5 mil diode, Ig was 4 X 10°” A and 
Ip was 8 X 10°* A. This corresponds to a bulk current density of 
3.15 X 107° A per cm’, and a perimeter current density of 2 X 10°" 
A per em. The relative importance of IJ», becomes greater as the diode 
size decreases. Extrapolating to an 8 pu diode, Jg = 1.5 X 10°" A 
and Iz = 5 X 107%* A. Total dark currents of targets in tubes have 
been 5 to 50 X 10°° A at 5 to 10 V or 2 to 20 X 107” A per diode, 
thus falling closer to the perimeter dependent limit. 

The larger area diodes, which minimize the contribution of the 
surface current, permit assignment of HZ; — H; in equation (10). In 
Fig. 9 the I-V characteristic of a typical 25 mil diode is shown with 
curves predicted by the Sah-Noyce-Shockley theory for different 
values of H; — E;. The reverse current density at 10 V for this diode 
was 6 X 10°° A per cm?. The lifetime 7, of 5 psec was obtained from 
conversion efficiency measurements. Also, 7, was taken as much 
greater than +, and the argument of the cosh was greater than unity. 
The depletion-region width in equation (10) was obtained from ex- 
perimental capacitance-voltage data and the expression w = « A/C. 
It may be seen in Fig. 9 that the reverse current calculated for the 
given 5 psec lifetime and the single recombination center at about 
an EF; — E; of 0.08 to 0.1 eV matches the experimental I-V charac- 
teristic. 

A quantity called the effective lifetime reg, which is the lifetime 
obtained from equation (10) on the assumptions that H; = H; and ‘ers 
= tp = Tr, may be used as a figure of merit to compare low leakage 
diodes of different resistivities. For our diodes, typical values of rere = 
100 psec were obtained. Ing and his co-workers obtained tere between 
10 and 40 psec for their gettered diodes just described,?? and Sah cited 
& rere Of 28 psec for a high lifetime diode.?® 

Other measurements on our large area diodes also suggest the re- 
verse current is dominated by bulk generation current within the 
space-charge region. Inversion layer surface leakage is not observed 
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Vig. 9— Reverse characteristic of 25 mil diode (circles) compared with char- 
acteristics predicted by Sah-Noyee-Shockley theory for different energy levels of 
recombination centers (dashed lines). 


in these diodes, nor is it expected, since charge at the interface is 
positive and induces an n-type accumulation layer. The activation 
energy of reverse leakage current has been measured as 0.5 to 0.6 eV 
in the range 0 to 30°C. Such an activation energy is compatible with 
recombination-generation current rather than inversion layer (chan- 
nel) leakage or with bulk diffusion current.2* In addition, the for- 
ward I-V measurements are characterized by 1 < m < 1.3 in the ex- 
pression J = I, exp(qV/mkT). Sah has found such a value of m for 
recombination-generation current.?° 

The foregoing observations were on relatively large diodes measured 
with no field applied across the passivating oxide. These results are 
all reasonably consistent with a model of reverse leakage current 
dominated by generation in the space-charge region of the metallurgi- 
cal junction, and the currents are satisfactorily low when proper dif- 
fusion conditions are used. 


4.2 Leakage Induced by Electron-Beam Charging of the Oxide 


The satisfactory behavior observed for large diodes is a necessary 
condition if good arrays of small diodes are to be obtained. How- 
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ever, it may not be sufficient if the field induced by electron beam 
charging of the oxide leads to high surface-generated currents. In fact, 
the reverse characteristics of 8 » diodes in target arrays, with a field 
across the oxide, are more complicated, although currents are still low 
enough at 5 to 7 V. 

This section shows how surface generation current is influenced by 
a field across the oxide on the junction side of the array. At present, 
the problem of achieving low dark current in silicon target arrays is 
to reduce this type of leakage current. However, if improvement ten 
times better than present results is achieved, bulk generated reverse 
current will again become a problem. 

The electron beam charging of the oxide can be simulated outside 
the tube by evaporating gold dots over both the oxide and p-regions 
and applying a negative bias. This gold dot structure then represents, 
in most respects, the situation in the camera tube, where the electron 
beam falls on both the diodes and the oxide. In Fig. 10 the experimen- 
tal I-V curves are compared with the maximum allowable current of 
50 X 10° A and the characteristic obtained for bulk generated current 
only. The bulk generated current curve is based on the assumption 
that the entire 1 X 10° A of reverse current for a 5-mil test diode is 
bulk current. 

The top curve (data points given by A) of Fig. 10 is a reverse 
characteristic measured by applying a negative bias to a 25-mil gold 
dot which covered 790 of the 8 » diodes in a test array as illustrated 
in Fig. 2. The current was scaled up to 435,600 diodes. 

The curve shown by the circles is a dark current characteristic 
measured in a tube. This curve shows behavior similar to the gold 
dot characteristic. However, the tube characteristics sometimes do 
not flatten out, for reasons not yet fully understood, although a few 
leaky diodes in the array are responsible in some cases. It may be 
seen that for these two curves the current rises steeply with increasing 
voltage and then changes slope abruptly between 6 and 10 V. Follow- 
ing the model of Grove and Fitzgerald,’ this behavior can be de- 
scribed as current generated by interface states. This current in- 
creases, and finally saturates as a depletion layer is induced under 
the oxide. In specimens with resistivity more than about 8 Q-cm, this 
depletion layer may result from the merging under the oxide of the 
space-charge regions of the 8 » diodes, which are less than 12 » apart 
edge to edge. 

However, curves like those of Fig. 10 are also observed on targets 
with substrate resistivity as low as 0.1Q-cm. In this case, the deple- 
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Fig. 10— Total diode reverse current as a function of bias for target arrays. 
Current measured by contacting gold dots over diodes and oxide or as dark 
current in tube. Gold dot measurements cover 790 diodes and are extrapolated 
to 435,600 diodes. Dashed line would result from bulk space-charge generation 
alone. 5 X 10°8 amperes is the approximate upper limit for good performance. 
tion is induced by a field normal to the semiconductor surface result- 
ing from the negative bias on the gold dot, as in the gate controlled 
diodes of Grove and Fitzgerald.*® In either case, the entire silicon sur- 
face between diodes is in a depleted condition. Every surface recom- 
bination-generation center on that part of the target surface not ac- 
tually occupied by a diode contributes to the reverse current. 

After depletion is established, the subsequent slower increase in 
reverse current beyond the discontinuity in Fig. 10 is caused by deep- 
ening of the space-charge regions under the oxide and around the 
metallurgical junctions. No decrease in current occurs at higher volt- 
age across the oxide, as observed by Grove and Fitzgerald,1® since in 
our case the potential on the oxide equals that on the p-region. Under 
these circumstances, inversion cannot occur to isolate the surface 
states from the depletion region.1® Grove and Fitzgerald relate sur- 
face generation current to the surface-recombination velocity s, by 
the equation: 


I = qnjs. (12) 
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Using the current at the discontinuity s, is estimated as 24 cm per 
second. 

The density D,, and electron capture cross section o; of interface 
states on a similar oxide, which had been through the same diffusion 
process, were measured by the MIS conductance technique of Nicol- 
lian and Goetzberger.*® D,, was 10't cm~? eV-? near the center of the 
energy gap and o, was 2 X 107° cm”. Using Grove and Fitzgerald’s"® 
definition of surface-recombination velocity for a depleted surface: 


So = o8v,,0kTD,, , (13) 


So is 16 cm per second with v;, as 107 em per second. This value is in 
reasonable agreement with the value obtained from the generation 
current. In addition, conversion-efficiency measurements on a similar 
oxide, that is, one exposed to diffusion conditions, yielded surface- 
recombination values of about 50 cm per second. Notice, however, 
that the diode leakage and efficiency pertain to different values of 
surface potential. 

The current at the discontinuity in gold dot I-V curves has been 
reduced by two methods. The upper curve in Fig. 10 for a (111) sur- 
face can be lowered to the solid triangle curve by hydrogen annealing 
at 500°C. The decrease in current by an order of magnitude at the 
discontinuity is assumed to result from a reduction in fast state den- 
sity expected from this treatment.?°°? A (100) silicon surface with- 
out hydrogen anneal produced the curve shown by squares. The cur- 
rent at the discontinuity has been lowered to 5.5 x 10° A at 2.5 V, 
presumably because of a reduction in interface state density.?® 5? This 
current corresponds to an s, of 1.6 cm per second. Thus, significant 
decreases in the leakage current have been made by these simple 
changes in processing and further improvement can reasonably be 
expected. 


V. DEFECTS 


A third very important requirement of any camera-tube target is 
freedom from defects. For the silicon target this means near perfec- 
tion in an array of nearly 14 million diodes and the demands on 
planar technology are obviously severe. Leaky diodes, for example, 
can cause bright spot defects, while diodes which are covered and 
cannot be contacted by the electron beam, cause dark spot defects. 
Pinholes in the passivating oxide may cause bright spots by allowing 
the electron beam to contact the substrate directly. Certain dark 
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features have been identified with dislocation arrays revealed by etch 
pits in a neighboring slice of the crystal. 

Not all defects can be explained at present. Figure 11 shows some 
which have been identified. They were intentionally introduced dur- 
ing the target processing. The picture at the top was taken with the 
camera tube viewing a transparency illuminated by tungsten light. 
Below it are photomicrographs taken of small areas of the target after 
it was removed from the tube. The dark area in the display corre- 
sponds to a spot in the array where oxide holes are missing because 
high spots in the oxide lifted the mask and allowed exposure of the 
photoresist over that area. The bright spot on the left corresponds to 
a large hole, revealed in the photomicrographs, which was etched in 
the oxide because contamination on the mask prevented exposure of 
the photoresist and, therefore, a hole was etched in the oxide. Boron 
diffused into that entire area and produced a large leaky diode. In ad- 





Fig. 11— Example of gross defects in target. Top: picture from TV monitor. 
Lower right: photomicrograph of target showing spot where oxide holes are miss- 
ing, corresponding to dark spot in display. Lower left: photomicrograph showing 
large hole in oxide which corresponds to bright spot in picture. 
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Fig. 12 — Picture taken with relatively defect-free target. 


dition to defects associated with planar technology steps of target 
fabrication, bright spots can also be introduced by the deposition of 
the semi-insulating film for dissipating charge from the area between 
diodes. 

The defects of Fig. 11 were introduced deliberately. Figure 12 
shows a picture with only a few small, unintentional, defects visible. 
We have not yet made a target entirely free of defects. Substantially 
ereater improvement in the defect situation has been made by the 
eroup at Bell Telephone Laboratories, Reading, Pas 


VI. SUMMARY AND CONCLUSIONS 


The status of the three important factors efficiency, diode leakage 
current, and defects in the array may be summarized as follows. 


(i) Satisfactory conversion efficiency has been achieved. Surface 
recombination velocity (S) at the illuminated surface is the dominant 
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parameter controlling efficiency, and it should be $10° cm per second. 
Sufficiently low S can be obtained by wet chemical treatments or by 
oxidation in steam, but the most reliable treatment has been a phos- 
phorus diffusion. This provides a built-in field which repels minority 
holes from the surface. S values of 50 cm per second, and efficiency 
of 40 to 60 percent throughout the visible region, have been obtained 
with this treatment. Adequate diffusion length, 30 to 100 u (r = 1 to 
10 usec) has also been obtained. 

(47) Diode leakage current is low enough ($1 X 10°” A per diode 
or $50 X 10°° A total dark current) for satisfactory operation at low 
target voltage (S7 V). Bulk generated current of <1 X 10°° A 
per cm’ at 10 V is observed in large diodes (5 to 40 mil diameter). 
This would yield 4 X 107° A for an 8 » diode. However, surface genera- 
tion complicates the behavior when a field is applied across the pas- 
sivating oxide as under a gold dot or in electron beam scanning. This 
causes an initial steep rise in current followed, usually, by an abrupt 
decrease in slope at 6 to 10 V. The current at which this occurs has 
been lowered substantially by use of (100) instead of (111) silicon slices 
and by hydrogen annealing. 

(ui) Present technology produces targets which are reasonably 
defect-free but processing must be improved to eliminate defects com- 
pletely. Leaky diodes or groups of diodes and oxide pinholes cause 
bright spots, while diodes which are covered and cannot be contacted 
by the electron beam cause dark spots. A decrease in average dark 
current should reduce the ability to observe fluctuations from diode 
to diode. 


The emphasis on these three factors is not intended to imply that 
they are the only critical problems. Two others, discharging of the 
passivating oxide and resolution, have been studied by Crowell and 
Labuda.? 
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A Charge Storage Target for Electron 


Image Sensing 


By EUGENE I. GORDON and MERTON H. CROWELL 
(Manuscript received April 12, 1968) 


A charge storage target consisting of a dense array of silicon photodiodes 
has been described as the image-sensing element in a vidicon type of camera 
tube for the Picturephone” station set. The target stores a spatially distrib- 
uled charge pattern corresponding to an optical zmage in the form of a 
partial discharge of the reverse-bias voltage of the diodes. The discharge 
results from leakage current associated with hole-electron patrs created in the 
silicon substrate by incident photons during the raster interval. Recharging 
of the diodes to the full reverse-bias voltage along a prescribed raster by the 
scanning, low energy, electron beam creates the desired video signal. 

This paper describes creation of the hole-electron patrs in the silicon 
substrate by impinging high energy electrons. Since these electrons, incident 
from the side opposite the diode array, create a multiplicity of pairs, charge 
gain results. As in photon sensing, the discreteness of the array allows 
preservation of detail in the spatial distribution of impinging electrons. 
ALTeasurements of charge gain as a function of electron energy and target 
resolution are presented. 

Applications in scan conversion, low light level TV, X-ray image 
intensification, and electron microscopy are indicated. 


I. INTRODUCTION 


The subject of this paper is the use and properties of a self-sup- 
porting silicon wafer containing an array of about one-half million 
diodes in an area of 12.5 millimeters on a side. See Fig. 1. The thick- 
ness of the substrate under the diode array is in the range 10-25 microns 
depending on the application. The wafer perimeter which is consider- 
ably thicker, provides increased physical strength. As an image sensing 
target in a vidicon type of camera tube, developed for the Picture- 
phone® station set,? it converts incoming photons that are absorbed 
in the n-type conductivity substrate into hole-electron pairs. 
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Fig. 1— Cross section of active target area illustrating the substrate, diodes, 
oxide mask, resistive sea, and scanning electron beam. 


Except for those holes created within a surface layer, a few thou- 
sand angstroms thick opposite the diodes, there is virtually unity 
probability that any minority carrier hole can diffuse to the deple- 
tion region of one of the normally reverse-biased diodes. The hole is 
swept across the depletion region to the diode p-type conductivity 
region and contributes to the total leakage current of the diode. The 
totality of holes reaching the diode during a raster interval, partially 
discharging the diode, constitute a stored charge proportional to the 
integrated local intensity of the photon flux. Recharging of the diode 
by a scanning, low energy electron beam creates a current in an ex- 
ternal circuit which constitutes the desired video signal. The re- 
charged diode is primed for integration of the hole flux during the 
next interval by the same process. Figure 2 illustrates the performance 
on one such target illuminated by a conventional TV test pattern and 
scanned in a 525-line raster with a frame interval of 1/30 second. 
Other forms of radiation will create hole-electron pairs in a silicon 
substrate. Figure 2 therefore indicates the potential performance for 
imaging these as well. 

In this paper the radiation of interest is energetic electrons. An 
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energetic electron, impinging on silicon will create 286 hole-electron 
pairs per kiloelectron volt of kinetic energy. When the holes can dif- 
fuse to the nearest diodes with high probability, the resulting charge 
exceeds the charge incident on the target and amplification results. 
Thus the target, in conjunction with scanning, video processing, and 
display is an electron image transducer. It has the potential of being 
useful in any one of a large number of systems or devices in which it 
is desired to convert spatial intensity variations in incident radiation 
into a visible image. For example, image intensifiers transduce an 
optical image into an equivalent electron image by absorbing the in- 
cident light on a large area photocathode. The resulting low energy 
electron image is refocused at high energy onto a second plane by an 
appropriate electron-optical system. A phosphor screen transducer 
placed in this plane produces an intensified optical image. The elec- 
tron image in an electron microscope similarly is viewed by a phosphor 
screen transducer. The addition of a transducer for X-rays to light 





Fig. 2— Monitor display illustrating performance of diode array target in 
a camera tube. 
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at the front end of an image intensifier allows X-ray image intensifica- 
tion. The transducer would be a phosphor screen photon-coupled to 
the photo-cathode of the image intensifier. 

In all of these applications it is becoming common practice to view 
the output phosphor sereen with a closed circuit TV system. This 
allows observation of a magnified, bright image with the ability to 
perform video processing and to produce a permanent record on video 
tape. In these cases, especially when a direct visual output is not 
really necessary, a single stage clectron-to-video transducer with 
charge amplification replacing the output phosphor screen, lens, and 
pickup tube would effect a considerable simplification in the system 
with savings in volume and cost and would afford the possibility of 
better performance. 

Indeed in a class of pickup tubes exemplified by the secondary 
election conduction (SEC) camera tube,t the fundamental image 
sensor is a photocathode, and the electron image is focused at high 
energy onto a charge storage target with charge gain. The target is 
scanned in a vidicon fashion to produce the video output. In this case 
the target is an insulator, KCl, evaporated onto a thin metallic back- 
plate. The charge in this case is created in a region of high electric 
field, and the resulting electrons are swept out. The remaining posi- 
tive charge is immobile and constitutes the storage mechanism. Such 
a target is similarly an electron-to-video transducer with charge gain 
and has been used in the applications suggested above. 

In a second class of applications exemplified by double beam 
storage tubes, Fig. 3, the input electrons are produced by a writing 
gun and form an amplified, stored charge pattern. The stored pattern 
could represent, for example, a video display, as in a scan conversion 
device,> an oscilloscope trace for highspeed, nonrepetitive events® or 
a closed, nonintersecting path for variable delay of analog or digital 
signals.” 

The double-beam device of Fig. 8 has been chosen as the vehicle for 
study of the target imaging characteristics under electron bombard- 
ment. The target has also been studied under conditions which allow 
multiple readout of the stored charge. In what follows the double- 
beam device will be referred to as a scan converter. 


II. THE SCAN CONVERTER 


By way of introduction and for comparison, it is worthwhile to 
review some aspects of the target optimized for use in a camera tube. 
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Fig. 3— Scan converter. Notice back-to-back gun structuré. 


The array of reverse-biased diodes stores positive charge, created in 
the form of minority carrier holes by photons incident on the n-type 
silicon substrate from the side opposite the diode array. The holes 
diffuse to the diode depletion region and are swept across to the p- 
region or island of the diode. The stored charge is manifest as a partial 
discharging of the diodes from their full reverse-bias voltage, which 
equals the fixed potential of the target substrate, Vr, relative to the 
potential of the electron beam cathode. The scanning, low energy 
electron beam, landing on the exposed p-islands, periodically charges 
them toward cathode potential. Full recharging brings the potential 
of the p-islands down to cathode potential at which point the elec- 
trons can no longer land on the surface. This reestablishes the full 
reverse bias across the diodes. 

Since the substrate potential is held fixed, the p-islands of partially 
discharged diodes exhibit a positive potential variation on the sur- 
face facing the electron beam. These islands are charged back to 
cathode potential on the next pass of the scanning electron beam. 
The recharging current constitutes the desired video signal and is 
proportional to the number of holes collected by the diodes at the posi- 
tion of the scanning beam. Since the number of holes stored by these 
diodes is proportional to the number of photons incident during the 
preceding frame period, the video current measures the integrated 
light intensity at the position of the diodes. The use of a discrete diode 
array preserves the spatial integrity of the incident light pattern to 
the extent that lateral diffusion of the holes is negligible and spatial 
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frequencies with periods comparable to or smaller than the diode 
spacing are not important. 

In the design of the camera tube target the proper choice of diode 
capacity is important as can be seen by the following example. Sup- 
pose that the peak video signal is set at 200 nanoamperes and the 
scanned area has 670,000 diodes. The scan time is approximately 
1/30 second. The maximum charge restored to the diodes during 
scanning is 2 X 10-7 amperes X 1/30 second = 6.7 X 10-° coulombs 
or 10-** coulombs per diode. The voltage swing of a p-island should 
never be more than about 5 volts since the beam may be pulled, 
producing landing errors for greater values. Thus the minimum re- 
quired diode capacity is approximately 10-** coulombs/5 volts = 
2 X 10 farads. 

Under the conditions specified, the scanning beam current required 
to recharge the diode to a major fraction of the full reverse-bias 
voltage (during the submicrosecond interval the beam is incident on 
the diode) is 1 to 2 wamperes. Beam currents below this value lead 
to image lag resulting from incomplete recharging of the diode and 
reduced video signal levels. If the diode capacity is doubled relative 
to the minimum value, keeping everything else fixed and neglecting 
the dependence of capacitance upon reverse-bias voltage, the voltage 
swing of the diode is halved. 

The beam current required to recharge the diode to the same ex- 
tent as in the previous case is increased significantly, possibly more 
than a factor of two, because the beam landing efficiency is a strong 
function of the landing energy and is significantly reduced if smaller 
voltage swings are used. (The beam landing efficiency is defined as 
the ratio of the surface charging current to the incident current. It 
is less than unity because of secondary emission and elastic reflection 
of electrons.) Large beam currents are not desirable and in general 
not practical; hence the diode capacity must be critically controlled. 

For the diode geometry used in the camera tube the silicon resis- 
tivity to achieve the appropriate capacitance range is about 10 Q-cm. 
The optimum capacity may be achieved by adjusting the potential 
of the target substrate which varies the full reverse-bias voltage. 

A major requirement on the diode performance is the ability to 
sustain the reverse bias for an interval that is long compared with 
the scanning interval. With a diode dark current of 10°* amperes, 
the time for the diode reverse bias to decay to less than half its 
original value without recharging is about one second, which is sig- 
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nificantly greater than the usual 1/30 second recharging interval. The 
ability to hold the diode leakage current well below 10-* amperes at 
room temperature over the necessary range of target voltage represents 
one of the major accomplishments of the target development pro- 
gram for the camera tube. Leakage current for the full array is usually 
well below 50 nanoamperes. A major requirement on the substrate is 
that volume and surface recombination rates of minority carrier holes 
be reduced to the point that a large fraction of the holes created by 
incident radiation can reach the diode depletion region. 

The scan converter device is based on the charge storage and elec- 
tron beam readout properties of the target. The ability to create 
hole-electron pairs in the target substrate by bombardment with 
energetic electrons forms the basis of the writing function. Writing is 
accomplished with a CRT type of electron beam, current modulated 
with the incoming video signal and incident on the side opposite the 
diode array (Fig. 3). Each incident electron creates a multiplicity 
of hole-electron pairs, some of which discharge the diodes, creating a 
pattern of stored charge just as in the camera tube. The charge stored 
in the diode array actually can be greater than the charge deposited 
by the incident writing beam, requiring however that the number 
of hole-electron pairs created per incident electron times the proba- 
bility of collection for the hole, be greater than unity. The ratio of 
stored charge to incident charge will be called the charge gain. 

Despite a possible difference in scanning rates, in equilibrium the 
current level of the video signal generated by the reading beam will 
be larger than the current in the writing beam by just the charge 
gain factor. (Application of the concept of charge conservation will 
indicate the validity of the statement.) On the other hand, the reading 
beam current is required to be greater than the video signal because 
the beam landing efficiency is substantially less than unity. Hence 
the writing beam will usually have much lower currents than the 
reading beam. In addition, the writing beam electrons will land with 
energies in the kiloelectron-volt range while the reading beam elec- 
trons will land with energies in the range 0-5 electron-volts. The 
result is that the writing beam may be much more finely focused 
than the reading beam. Since the penetration range of the writing 
beam electrons in the silicon substrate is normally under one micron, 
the resolution of the scan converter should be essential identical 
to that of the camera tube for very short wavelength light which 
is absorbed close to the surface. 
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Figure 3 indicates that some of the writing beam current could 
return to the writing beam cathode through the target resistor, 
creating an undesirable video signal. In actual fact, the potential 
of the mesh immediately preceding the target is adjusted to such 
a value that the effective secondary emission coefficient of the sur- 
face is almost exactly unity. This balance is achieved almost instan- 
taneously so there is virtually no net writing beam current to the 
target (but rather the current is to the mesh) and virtually no cross- 
talk. In case the precise secondary emission balance cannot be achieved 
uniformly over the target, the secondary emission reduction in writ- 
ing beam current to the target coupled with the charge gain for the 
desired video signal brings the crosstalk signal down to a tolerably 
low value. 

The writing beam is scanned at a rate appropriate to the incoming 
video signal. On the other hand, the reading beam may be scanned 
at any rate desirable. Scanning the reading beam more rapidly than 
the writing beam produces a multiplicity of time-compressed frames 
as would be required for slow scan TV. 

There is a technique for achieving multiple readouts while preserv- 
ing the full gray scale. During the early stages of the development 
of the camera tube, it became clear that it was difficult, if not im- 
possible, to obtain a stable video response unless the silicon dioxide 
(which covers the area between diodes and prevents beam electrons 
from landing on the substrate as well as protecting the diode junc- 
tions) was provided with a charge leakage path of some kind. With- 
out the charge leakage path it was not possible to control the surface 
potential of the oxide and a great many deleterious effects resulted. 

The scheme adopted to provide the leakage path has come to be 
known as “the resistive sea” and is simply the formation of a thin 
resistive film over the entire array. The sheet resistance of the film 
is chosen to allow surface leakage with a charging time constant 
of order one second which allows control without causing loss of 
resolution through lateral spreading. Additional constraints on the 
dielectric relaxation time and thickness of the film are required to 
allow complete charging of the diodes to avoid lag. These same 
parameters can be optimized to allow multiple readout. 

Consider Fig. 4 which illustrates the target with a resistive sea 
and some lumped circuit equivalents for the various parameters of 
interest. The diode has a capacity Cz. A pulsed current source 1, 
accounts for the partial discharging of the diode when the writing 
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Fig. 4 — Schematic of lumped circuit equivalent for diode and resistive sea. RpCy = 
pe <tr = 1/80; Ca/Cp « 1 low lag; Ca/Cp > 1 for multiple readouts. 


beam is incident on the substrate adjacent to the diode. Assuming a 
frame compression ratio of N is desired, the video signal must be 
created N times for each scan of the writing beam. To maintain 
adequate signal-to-noise ratio the video output current must be at 
a level comparable with that achieved in the camera tube. This re- 
quires N times as much charge storage; hence the diode capacity 
must be about N times larger than the equivalent capacity of the 
diode in the camera tube. The most direct way of achieving the 
increased capacity without changing the diode geometry is the use 
of N? times higher conductivity in the n-type silicon substrate than 
might be desirable for conventional camera, use. 

The thickness and resistivity of the resistive sea is arranged to 
allow negligible leakage from one diode to the next during the 1/30 
second between scans of the reading beam. Thus the lateral or spread- 
ing resistance of the film will be ignored as well as the shunt capacity. 
This considerably simplifies the discussion. The leakage resistance 
Ry, and film capacity C; must have an RC time constant much less 
than the 1/30 second between successive scans of the beam, yet long 
compared with the approximately 10-7 second or less that the reading 
beam is incident on the diode. For this particular geometry the RC 
time constant is about equal to the dielectric relaxation time constant, 
peep, Of the film material (p is the resistivity, e the dielectric constant 
and ¢«, the permeability of free space). Assuming e & 10 and choosing 
pee, & 5 X 10° second yields p 6 X 10° Q-cm. 

The ratio of diode capacity to film capacity Cy/C; should be about 
equal to N — 1, for reasons which will become clear shortly. Thus 
the film capacity should be about the same as that of the camera 
tube diode, 2 x 107° farads, requiring a film thickness of about 2 
microns over the 8 micron diameter diodes. The sheet resistance of 
the film is about 3 xX 107° ohms per square which for a 1/30 second 
frame is adequate to control surface charging without reducing 
resolution. 
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The substrate potential is held at Vz , the target bias supply voltage. 
In the absence of stored signal the reverse-bias voltage of the diode 
equals V; , the voltage across the film is zero, and the surface potential 
of the film equals the cathode potential, that is, zero. Reading beam 
electrons incident on the surface have a landing efficiency of essentially 
zero and no further negative charging of the surface can occur. 

Suppose now that writing beam electrons are incident on the substrate, 
producing hole-electron pairs and that a fixed fraction of these with total 
charge Q, diffuse to the diode, discharging the diode by a voltage 
AV, + Q./C.z . The charge Q, will be referred to as the signal charge and 
AV, as the signal voltage. It is assumed for the purpose of discussion 
that AV, < Vso that the diode capacity C, is constant and has a value 
appropriate to V; . In practice the film surface has negligible capacitance 
to ground so that no displacement current need flow through the film 
capacitance when the interface potential] rises and the voltage across the 
film therefore remains at zero. Thus the film surface is brought to a 
potential AV, . The maximum value of AY, is about 5 volts to avoid 
beam bending as in the camera tube. 

When the reading beam comes to the diode, the surface of the film is 
charged down essentially to cathode potential. Thus the series combina- 
tion of capacitors C, and C, is recharged by an amount AY, , requiring 
that the reading electron beam place a charge on the film surface 


AQ, = AV,/(U/C2 + 1/C)) 
Q./ + C./C)). 


(The reading beam current is set at a value high enough to provide 
the charge, AQ,, during the short reading time interval. During this 
interval, conduction current through the film is negligible compared 
with the displacement current.) The charge AQ, flows through the 
target resistor Ry; producing an output voltage proportional to the 
signal charge Q,. The signal charge stored in the diode capacitance, 
originally Q,, is reduced by the amount AQ, to a value 


Qi = Q./(L + C;/Cu). 


The voltage across the diode, originally Vp — AV,, is now Vr — 
AV,./(1 + C;/Ca). The voltage across the film is AV,/(1 + C;/Ca) 
which, because of the short RC time constant of the film, decays to 
zero before the next return of the reading beam. The surface poten- 
tial therefore achieves the value 
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AV, = AV./(1 + C;,/C2) 


as compared with to the original value AV, immediately preceding the 
first read. The process is then repeated. At the nth reading after the 
original signal charge Q, was established by the writing beam, the 
output voltage is proportional to 


AQ, = Q./( + C./C;)" 


and the film surface potential at the instant preceding the nth read is 


AV, = AV,/(1 + C,/Ci)". 


Since by design C,;/Cz is uniform over the target, the relative gray 
scale is preserved, the output signal is linearly proportional to the 
input signal and the output signal decays from one read to the next 
in a well-defined exponential fashion. The exponential time constant is 


T= 7,(1 ae Ci/C;) 


in which 7, is the reading frame time. Thus for a compression ratio of N 
an appropriate value might be C./C, = N — 1 which implies that the 
signal decays to 1/e in one writing frame time. For this case AQ, = Q,/N 
which establishes that the signal level is the same as that in the camera 
tube. Since AQ,/AV, = C,(N — 1)/N this establishes the correctness 
of the choice of C; about equal to the capacity of the camera tube diode 
(at least for N > 1). 

Incidentally, in the camera tube and in many other applications, 
it is desirable to read virtually all of the signal stored in the diode 
on the first read (that is, one wants N = 1). This is accomplished 
by making Cq/C; as small as possible. Thus the film thickness in a 
low lag target should always be under 0.1 micron as compared with 
2 microns in the scan converter example above. 


III. EXPERIMENTAL RESULTS 


Figure 5 shows the scan converter. The tube is two one-inch vidicon- 
type guns facing opposite sides of the diode array target. The read- 
ing gun has a close-spaced decelerating mesh, as required for good 
resolution, while the writing gun mesh has been spaced back about 
one inch. This space permits light to be directed onto the target for 
measurement of the collection efficiency for holes generated by photons 
as in a camera tube, 
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Fig. 5—Scan converter. The diameter of the glass envelope is one inch. The 
large spacing allows optical measurements to be made on the target. 


Most of the targets used to date are identical to those used in 
the camera tube; hence they have not been used for multiple readout. 
However, the validity of the capacitive subdivision of the available 
signal and the multiple read capability has been established with at 
least one target. 

The resolution of the scan converter target is measured as follows. 
Both the writing gun and the reading gun are normally operated 
with magnetic deflection and focus. The writing beam current is 
modulated sinusoidally in time at frequencies locked to the horizontal 
scan rate producing a fixed sinusoidal charge pattern on the array. 
Feed-through is eliminated by adjusting the potential of the writing 
beam mesh as described earlier. The reading beam scanning rate is 
locked to that of the writing beam to avoid fluctuations in the rela- 
tive number of reads per write. The reading beam scans over the 
fixed charge pattern producing a sinusoidal output signal. The meas- 
ured peak-to-peak amplitude of the signal normalized to the value 
measured at low spatial frequencies is called the contrast ratio or 
modulation transfer function (MTF). 

The MTF as a function of spatial frequency in cycles per inch 
of target is shown in Fig. 6 for a target with a substrate thickness 
of 20 microns and a diode spacing of 20 microns. Notice that the 
MTF is 55 per cent at 300 cycles per inch or 12 lines pairs per mm. 
The falloff may be attributed to four sources: (2) writing beam size, 
(it) reading beam size, (2) finite number of diodes, and (iv) lateral 
diffusion of holes. For the particular target the first two are least 
significant since under magnetic focus the reading beam is capable 
of resolving individual diodes and the writing beam has even greater 
resolving power. The third source of falloff may be appreciated by 
noticing that the linear diode density is 50 per mm, which means 
that there are about four diodes per spatial period at 12 cycles per 
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Fig. 6 — Measured modulation transfer function for the scan converter target. 


mm. This is barely enough and a part of the falloff may be attrib- 
uted to this fact. 

Most of the falloff results from lateral diffusion of the holes which 
under the conditions of the measurement are produced within 1/10 
micron of the substrate surface. Therefore, those that are collected 
must diffuse a distance of about 15-20 microns perpendicular to the 
surface. Since the lateral diffusion distance can be of the same order 
or greater, any detail requiring a spatial frequency of greater than 25 
cycles per mm is effectively destroyed. The precise nature of the MTF 
falloff for the target depends on the target thickness, the diode geom- 
etry, and the volume recombination length and surface recombination 
velocity for holes. We plan a detailed discussion for a future paper. 

The MTF at high spatial frequencies can be increased considerably 
by using thinner targets and increasing the diode density. Figure 2 
illustrates a target with a substrate thickness of 10 microns and diode 
spacing of 15 microns corresponding to 67 diodes per mm. For this 
target the MTF is 100 per cent out to 280 cycles per inch and falls to 
50 per cent at well over 400 cycles per inch. For this target the read- 
ing beam contributes substantially to the falloff in resolution. 

The effective charge gain, of course, is a function of spatial fre- 
quency and its relative dependence on spatial frequency is the MTF 
shown in Fig. 6. However, the absolute charge gain for uniform storage 
patterns is a parameter of importance. Large values of gain are not 
really required or desirable for the scan converter; values of order 10 
are useful. Values of order 10° or greater are desirable for some of 
the other applications. 
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The charge gain is measured in the following way. The unmodulated 
writing beam is scanned over some small area of the substrate at a 
current level sufficiently low that the resulting holes do not completely 
discharge the diodes. This can be checked by varying the writing beam 
current. The writing beam mesh is held negative relative to the sub- 
strate to suppress true low energy secondary electrons but not elasti- 
cally reflected primaries. The redistributed secondary electrons, which 
land at much reduced energies compared with the primary beam, pro- 
duce few, if any, hole-electron pairs. With the reading beam turned 
off, the current in the target lead measures the incident current in the 
writing beam that penetrates into the target. The charge gain in this 
case is effectively zero since the diodes are discharged and the hole- 
electron pairs are forced to recombine. The reading beam is turned on 
and scanned over an area which includes the area scanned by the writ- 
ing beam. The increase in time average current measured in the target 
lead measures the arrival rate at the scanned diodes of all the holes 
generated in the substrate. The thermal part of the hole generation 
(the diode dark current) is determined by turning off the writing beam. 
The net current divided by the writing beam current penetrating the 
target is called the charge gain. A preliminary discussion of the ex- 
pected results is appropriate at this point. 

The charge gain should be describable by the expression 


a) = [te SE” ae (1) 


in which (x) is the probability that the hole, created at a distance x 
from the surface upon which the electrons are incident, will reach the 
diode space charge region and be collected, dr(dP/dx) is the number 
of hole electron pairs created between x and x + dz for an electron 
incident normally with kinetic energy V electron volts, and L is the 
substrate thickness. The function 


_ [f° 2G, V) 
pv) = [| SS ae (2) 
which defines the total pair production per incident electron is given by 


PWV) & V/3.5 (3) 


(corresponding to the fact that it takes on the average 3.5 eV to create 
one hole-electron pair). Writing 


CHARGE STORAGE TARGET 1869 


G(V) = Pw) [ n(x) Ne ae/ | ay ar 


= P(V)n(V) 


(4) 


combines the effects of a variety of different phenomena into the function 
n(V) which will be called the effective collection efficiency. 

A number of different target fabrication procedures have been studied 
to optimize the effective collection efficiency. The best to date is similar 
to that used for optimizing the sensitivity of the target for visible light 
and consists of a thin n” layer formed on the writing beam side of the 
target. The n” Jayer is formed by a shallow phosphorous diffusion into 
the n-type conductivity substrate. The effect of this layer is discussed a 
few paragraphs further on. The measured collection efficiency n(V) = 
3.5G(V)/V as a function of electron energy is shown in Fig. 7. The 
collection efficiency approaches 0.5 for electron energies of order 10 KeV 
but falls well below 0.01 for energies below 2 KeV. Indeed in the energy 
range under 2 KeV the measured effective collection efficiency of a target 
for which the phosphorous diffusion was eliminated (the surface was 
merely etched) was higher at a constant value of 0.016. 

An understanding of the measurements requires a knowledge of 7(z). 
However, a theoretical] evaluation of n(V) is complicated by the fact that 
dP(x, V)/dz is not negligible very close to the surface and (x) near the 
surface is strongly dependent on the surface properties of the silicon 
crystal. Aside from the surface complication 7(v) may be accurately 


(3.5G/V) 





COLLECTION EFFICIENCY 
nv) 





ELECTRON ENERGY,V, IN KILO-ELECTRON VOLTS 


Fig. 7 — Measured effective collection efficiency for holes created by electrons of 
incident electron energy V. The effective collection efficiency n(V) is determined from 
the measured charge gain G(V) by the relation n(V) = 3.5G(V)/V. 
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evaluated on the basis of a model in which only the bulk recombination 
lifetime 7, the surface recombination velocity S at « = 0, the diffusion 
coefficient D, all for the minority carrier hole, and the substrate thickness 
L are relevant, 


_ sinh (2/1) + (D/IS) cosh (x/1) 
1(@) = sinh (L/1) - (D/IS) cosh (L/D 


1 = (Dv), 





(5) 


which follows from a one-dimensional Green’s function solution of the 
diffusion equation for excess holes. In fact (1) might be considered to be 
the normalized Green’s function solution for hole current through the 
plane at « = L. 

Quite generally (x) increases with increasing recombination length 1 
and the target fabrication is optimized to make I as large as possible. 
Typically 1 >> L and it is appropriate to make this assumption. Thus (5) 
becomes 


ne) SD (6) 


For an etched silicon surface S = 10°cm per second while D = 10 cm? 
per second. Thus for Z = 10 microns, which represents a minimum 
practical value, SL/D ~ 10°. It can be appreciated that (x) will be 
quite small for « < ZL unless S is substantially reduced. Thus for low 
beam energies corresponding to small penetration depths n(V) & 7(0) = 
(1 + SL/D)~* ~ 10°? which is consistent with measurements on etched 
targets. As mentioned above, the most relevant technique among those 
that have been tried to reduce S for this application is a shallow phos- 
phorous diffusion into the surface upon which the electrons are incident. 
This produces an n” layer which repels holes diffusing toward the surface 
resulting in an effective value of S = 10°cm per second. As a result the 
x-dependence in (6) is relatively small. Unfortunately the phosphorous 
diffusion drastically increases the recombination rate of holes in the n* 
layer and the layer can be characterized as dead. As a result (x) is not 
well known for very small zx. 

The uncertainty in interpretation of the experimental results intro- 
duced by the dead layer makes it desirable to study also the collection 
efficiency for holes produced by incident photons. For this case the initial 
distribution of holes created by the photons is accurately known. A corre- 
sponding effective collection efficiency function n(A) can be defined for 
pair production by photons of wavelength 4, | 
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i 
/ n(a)ae “* dx 


0 
fe) 
| ae ** dx 
0 


L 
= i n(x)ae °* dx 
0 


n(A) = 


(7) 


in which a(A) is the absorption coefficient for photons of wavelength i. 
This formulation is not strictly correct for aL < 1 since light penetrating 
to the surface at x = L may be reflected back into the substrate. Also 
the distance L is not well defined because of the diodes. If it is assumed 
that within the dead layer of thickness 6, n(x) = O and beyond the 
dead layer (x) is given by the equation in (6) with zx measured from 
the edge of the dead layer, then for aL > 1 


n(d) & | Apes Bide ede 
6 


n(d) Fess ees (8) 


The measured collection efficiency as a function of wavelength (for 
the same target used for obtaining the data of Fig. 7) is shown in Fig. 8. 
The data were obtained by admitting light onto the target through the 
glass wall of the tube envelope. The data are corrected for Fresnel 
reflection losses at the glass surfaces and from the silicon. Curves of (8) 
with S = 1.1 X 10* cm per second, D = 10 cm’ per second, L = 20 


\ 
‘NO DEAD LAYER 


S=1.1x104 
‘\\_ CORRECTED FOR CM PER SECOND 
DEAD LAYER 


COLLECTION EFFICIENCY 7 (A) 





0.3 0.4 0.5 0.6 0.7 0.8 0.9 
WAVELENGTH, A, IN MICRONS 


Fig. 8—-The measured collection efficiency for holes created by, photons of 
wavelength \. The solid and dashed curves were computed from equation (8). 
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microns and 6 = 0 and 0.21 microns are also shown. The calculation is 
not carried beyond \ = 0.65 microns since for Jonger wavelengths the 
target does not absorb all of the light transmitted past the surface. In 
fact, the slight falloff in collection efficiency above 0.7 microns may be 
attributed to this. Notice the good agreement with the curve correspond- 
ing to 6 = 0.21 microns indicating the approximate validity of the 
simple model of the dead layer. The dead layer thickness corresponds 
roughly to the phosphorous diffusion depth. 

In the range 0.4 < \ < 0.65 microns the absorption depth for photons® 
corresponds very roughly to the penetration depth for electrons in the 
energy range 4 < V < 10 KeV. The penetration depth increases 
monotonically with \ or V to a maximum value of a few microns. 
Although the distribution of created holes for any value of ) is not the 
same as the distribution for any value of V, it is not surprising that the 
range of measured values of n(A) and n(V) are quite similar. In either 
case a reduction of S to 10° em per second, which is the more typical 
value observed in camera tubes, would increase the collection efficiency 
to close to unity over most of the range. A shallower phosphorous 
diffusion would also improve matters for low V or d. 

To date the only feature of the target which causes some concern 
about its future applicability is a slight burn-in or aging phenomena 
associated with the writing beam. It shows up as a decrease in charge 
gain over very heavily scanned areas. The rate of aging increases with 
writing beam current. It is not yet known whether the aging effect 
saturates, or whether it even occurs at all for low writing beam cur- 
rents typical of most applications. The aging may account for the 
larger values of S observed in these targets as compared with camera 
tube targets.® 


IV. CONCLUSION 


A charge storage target for low energy scanning beam readout has 
been described with respect to its ability to produce a video represen- 
tation of an electron image. Measurements of resolution and charge 
gain have been described. The target has general application in devices 
requiring an electron-image-to-video transducer and in scan conver- 
sion devices. 
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Hologram Heterodyne Scanners 


By L. H. ENLOE, W. C. JAKES, JR., and C. B. RUBINSTEIN 
(Manuscript received April 1, 1968) 


Several techniques are proposed in this paper which use a scanning 
coherent light beam to produce an electrical signal which corresponds to a 
scanned hologram. The hologram itself is not formed at the transmitting end 
of the system as a physical entity, rather the modulated electrical carrier 
frequency corresponding to the spatial carrier frequency of the hologram is 
generated by heterodyning. An advantage of the hologram heterodyne scanner 
ts that it reduces the spatial resolution required of the camera tube in a 
holographic television system by at least a factor of four. 


I. INTRODUCTION 


In principle, it is possible to conceive of a holographic television 
system, but a practical system hinges on removing a number of 
formidable roadblocks. Among these is transducing the holographic 
information into a relatively narrowband electrical signal by means 
of a camera that has limited spatial resolution. This problem is shared 
by both a three-dimensional and a two-dimensional holographic sys- 
tem, but with present technology the 3-D system certainly presents 
many more problems. The two-dimensional system becomes much 
more tractable if the camera resolution problem is overcome. It is to 
this problem and our proposed solution that this paper addresses itself. 

Consideration of this problem is not new, as evidenced by the 
early outline of the bandwidth requirements of a holographic televi- 
sion system by E. N. Leith and others in 1965.1 There have also been 
reports of experiments involving the transmission via television of a 
Fresnel type of hologram in which the original object was a trans- 
parency.”* The difficulties encountered in these transmissions illus- 
trate the crux of the problem. 

A hologram consists of low spatial frequency information (dictated 
by the spatial information content of the object) which has been 
modulated onto a high spatial carrier frequency derived from the 
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interference of the reference and object beams. Conventionally, a 
television camera converts a spatial display, which in this case is a 
hologram, into an electrical signal by scanning it with an electron 
beam. However, the camera has a spatial frequency response that is 
low-pass in nature and limited in extent. Hence, the spatial carrier 
frequency rests out on the skirt of the passband, and the positive and 
negative sidebands are thus treated differently, producing distortion. 
We have so far lacked a feasible technique which would allow the 
effective use of the available bandwidth. That is to say, we need a 
technique which would allow the hologram information to be low- 
pass in nature until after being processed by the photosensitive sur- 
face of whatever transducer is used. The method we propose, which 
we call the heterodyne scanner, does exactly this. As a consequence, 
the required spatial resolution is reduced by at least a factor of four 
compared with the transmission of a conventional off-axis reference 
beam hologram. 

The technique we propose envisions a scanning coherent light beam 
to produce an electrical signal which corresponds to a scanned holo- 
gram. In contrast with conventional methods, the hologram itself is 
not formed at the transmitting end of the system as a physical entity; 
instead the modulated electrical carrier frequency corresponding to 
the spatial carrier frequency of the hologram is generated by hetero- 
dyning. 

These heterodyne scanners have important potential advantages 
over conventional methods of scanning. First, the nuisance terms 
corresponding to the direct beam in the reconstruction of a conven- 
tional hologram are not transduced. This halves the scanning beam 
aperture’s resolution requirements. Second, the necessity for resolving 
the spatial carrier frequency of the equivalent hologram with the 
scanning beam is circumvented. This halves again the scanning beam 
resolution requirements, thus reducing resolution requirements to a 
quarter of the original requirements. 

Let us briefly review the formation of a conventional hologram in 
order to point out the terms involved in the formation and reconstruc- 
tion. We will then be in a better position to discuss the desired re- 
duction and the means for accomplishing it. 


II. CONVENTIONAL HOLOGRAM 


Figure 1 depicts a typical holographic situation, showing an ob- 
ject beam 


LG) = Ae ere 
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Fig. 1— Formation of a conventional hologram. Object and reference beams 
impinge on hologram plane P. 


and a reference beam 


E,(c, 9) = A,iteentn 


impinging upon a hologram plane P. The intensity is given by 
I= A? + A,(a, y)’ 
+ 2A,A,(x, y) cos [(w, — w,)é + ¢.(z, y) + ax + ¢,]. (1) 


For the formation of a conventional hologram the two beams must be 
at the same frequency, that is*, w, = w,, so that equation 1 becomes 


I= A; + A(z, y)” + 24,4.(z, y) cos [¢.(z, y) + ax+¢,]. (2) 


During reconstruction, the first two terms of equation 2 form the direct 
beam and are unimportant except that they tend to obscure the recon- 
structed image which comes from the third term. Notice that the third 
term is a spatial carrier wave which is amplitude and phase modulated. 
If the spatial bandwidth of the wave to be reconstructed, A ,(z, y)e’*'*"”, 
is 2 W, then the bandwidth of the direct beam, A? + A,(z, y)”, and the 
bandwidth of the desired term, A,A,(x, y) cos [¢o(x, y) + ax + ¢,], will 
each be 4W, as shown in Figure 2. Thus if angular overlap between the 
direct beam and the desired reconstructed wavefront is to be avoided, we 
require that the spatial carrier frequency a of the desired third term in 
equation 2 be at least 3W. This results in a total spatial bandwidth of 8W. 


* There are exceptions. See Ref. 4. 
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Fig. 2—Spatial spectra of a conventional hologram signal. This representa- 
tion is only intended to convey the limits of the spatial spectra and not the 
explicit form of the function. 


III. ELIMINATING TERMS CORRESPONDING TO THE DIRECT BEAM 


Since the third term in equation 2 is all that is necessary to recon- 
struct a hologram, it is desirable to eliminate the generation of the 
direct beam. The spatial carrier frequency a could then be reduced 
from its present minimum value of 3W to W, reducing the minimum 
total spatial bandwidth which must be transduced from 8W to 4W. 
We are able to accomplish this rather easily with the heterodyne 
techniques discussed in this paper. 

If equation 2 represents the intensity formed on a television camera 
tube, the camera output current will be proportional to the intensity 
being scanned by the electron beam. For an z-direction scan, the 
amplitude and phase modulated spatial carrier frequency is con- 
verted to a correspondingly modulated electrical signal. Notice that 
precisely the same electrical signal can be obtained by using a photo- 
multiplier or other large area photodetector with a pinhole aperture 
placed just in front of the photosurface, the pinhole being scanned 
over the photosurface in a raster-like fashion in the same manner that 
the electron beam scans the photosurface in a camera tube. The out- 
put current from the photodetector would be proportional to the in- 
tensity of the light sampled by the pinhole. 

A more desirable transducer is obtained if: one shrinks the refer- 
ence beam into a small pencil beam of pinhole dimensions which is 
then scanned over the photosurface. The pinhole aperture is now 
superfluous and can be removed, thereby gaining a distinct advantage. 
The intensity of the light on the photosurface contributed by the 
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object beam acting alone and the reference beam acting alone is now 
constant as a function of time. Thus, the nuisance terms correspond- 
ing to the direct beam in the reconstructed hologram, that is, the 
first two terms of equation 2, are not transduced as time varying 
currents. The only time varying term is produced by the interaction 
of the pencil reference beam with the object beam as it scans out the 
raster. This time varying signal is proportional to the -desired third 
term in equation 2 which represents the useful information. Thus, 
the carrier frequency (dictated by the angle between the reference 
and object beams) can be reduced from 8W to W, which cuts in half 
the resolution requirements of the aperture of the scanning beam. 
(See Fig. 2). 


IV. ELIMINATING THE SPATIAL CARRIER FREQUENCY 


In the situation discussed in the previous section, spectrum over- 
lap in the electrical signal is prevented by adjusting the angle be- 
tween the object and reference beams to provide a sufficiently large 
spatial carrier frequency. As a consequence, the aperture of the 
scanning beam must resolve the highest spatial sideband frequency 
associated with this spatial carrier frequency. 

As an alternative to generating the electrical carrier frequency by 
scanning a corresponding spatial carrier frequency, it is possible and 
indeed advantageous to generate the electrical carrier by heterodyning 
the object beam with the reference beam. This can be done in such a 
manner that it is unnecessary for the scanning aperture to resolve an 
equivalent modulated spatial carrier frequency. It must resolve only 
those spatial frequencies present in the complex field of the object 
beam itself, that is, it must resolve only a spatial bandwidth of 2W. 
This realizes a subsequent reduction in the resolution requirements 
of the scanning beam by a factor of two over and above the reduction 
discussed in Section III. 

For convenience, let us rewrite equation 1, 


[= At + A,(a, y)” 


+ 2A,Ad(z, y) cos [@o oe w,)t + do(X, y) + ar + ¢,], (1) 


which represents the intensity of the light incident on the photodetector 
when the object beam and reference beams are at different frequencies 
wy and w,, respectively. Conceptually, these two frequencies can be 
sidebands produced by modulation, or can be two phase-locked modes of 
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the same laser, or produced by two phase-locked lasers, to name just a 
few methods. 

In the present situation, the scanning pencil-like reference beam 
has an amplitude which may be written as a delta function, A, = 
S8(x — ut)8(y — vt), where u and v are the horizontal and vertical 
scanning velocities, respectively. When this is substituted into equa- 
tion 1 and integrated over the surface of the photodetector to find the 
output current, we obtain for the time varying component: 


a(t) = 2A,(ut, vt) cos [Wo — w, + au)t + do(ul, vt) + ¢,]. 


We see that the frequency difference w) — w, has been added to the 
electrical carrier frequency au produced by scanning the spatial carrier 
frequency a. Thus, we may use a spatial carrier frequency of zero and 
avoid spectrum overlap (which produces beam overlap problems at 
reconstruction) by controlling the frequency difference between the 
reference and object beams. This means that the aperture of the scanning 
beam need resolve only those spatial frequencies present in the complex 
field of the object beam. For example, in Fig. 2 it need resolve only the 
spatial spectrum of A(z, y)é**”. Thus, the total spatial bandwidth 
required has been reduced by a factor of four. 


V. ADDITIONAL CONSIDERATIONS 


It might be appropriate to speculate on other aspects of the holo- 
gram heterodyne scanner. One might consider other means of im- 
plementing this technique. For example, one could replace the large 
area photodetector with a small area or point detector such as a 
photodiode. Rather than deflecting the pencil-like reference beam 
relative to a fixed object beam, the object beam is deflected relative 
to a fixed pencil reference beam. This allows one to reduce the size 
of the photodetector from that equal to the size of the equivalent 
hologram to that equal to a point. The price paid for this simplifica- 
tion is the difficulty associated with deflecting the information-bear- 
ing object beam without introducing excessive distortion. This will, 
of course, be more difficult than deflecting a pencil-like reference 
beam. 

Another aspect of the use of the hologram heterodyne scanner is its 
potential sensitivity. Heterodyning is a well known technique for 
converting a weak light signal into an electrical current whose signal- 
to-noise ratio is determined by the fundamental quantum nature of 
light.® The heterodyne scanner should have potentially the best. sensi- 
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tivity obtainable by any technique which does not use charge storage. 

Without reported experimental verification of the hologram hetero- 
dyne scanner, it would not be especially fruitful to spend much time 
at this point speculating on possible display devices. There are a 
number of possible approaches to this problem but they are admittedly 
speculative. These include the Swiss Eidophor System® as a direct 
recording medium for the holographic electrical signal, or the use of 
photochromic materials for recording the output of a laser beam 
modulated by the holographic electrical signal and raster scanned 
over the photochromic surface. Both these methods have been referred 
to by E. N. Leith and his colleagues. 


VI. COMMENTS AND COMPARISONS 


Let us compare the hologram heterodyne scanner to the method 
proposed by C. B. Burckhardt and E. T. Doherty.? An extension of 
their technique* reduces the spatial resolution required of the camera 
tube by the same factor of four as the hologram heterodyne scanner, 
but even in its improved version it still requires a 50 percent greater 
electrical information rate than the scanner. However, the technique 
can be implemented with present technology. 

A technique has been reported by L. H. Lin which involves spatial 
frequency sampling to reduce the information content of the holo- 
gram.® This is accomplished by an iterative Fourier transform tech- 
nique. It is possible to augment Lin’s technique with the hologram 
heterodyne scanner to increase the bandwidth saving. The transla- 
tion of the hologram peculiar to Lin’s technique can be carried out at 
the receiver. K. Haines and D. B. Brumm have also reported on a 
technique which can be used to reduce the information to be trans- 
duced and is compatible with the hologram heterodyne scanner.® 

The hologram heterodyne scanner is a general technique that would 
apply to three-dimensional objects or two-dimensional transparencies. 
However, this does not mean that the implementation of this technique 
presents the same difficulties in each case. The spot size of the scan- 
ning reference beam and the speed of scan are directly influenced by 
the type of object to be transduced. The requirements are more 
stringent for the three-dimensional object if reasonable parallax is 
to be observed. Advances are currently being made in laser scanner 
technology which will help alleviate one aspect of the problem. A. B. 


*The technique presented in Ref. 7 has been extended by C. B. Burckhardt 
to apply to TV transmission but this specific aspect is unpublished. 
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Larsen of Bell Telephone Laboratories is conducting experiments 
implementing the hologram heterodyne scanner technique. Many of 
the concepts of this paper have been improved and extended by him 
and will be reported in the near future. 
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Analysis of Thermal and Shot Noise 
in Pumped Resistive Diodes 


By CORRADO DRAGONE 
(Manuscript received April 8, 1968) 


This paper discusses certain important aspects of the notse behavior 
of a pumped resistive diode containing shot and thermal noise sources. 
The derivation of the following result has a central role in the discussion. 
It is shown that the noise behavior of a pumped diode which does not contain 
1/f noise sources can be derived in a very simple way from Nyquist’s 
theorem. This follows from the fact that the small-signal terminal behavior 
of such a diode can always be represented, in the frequency range of practical 
interest, by means of a connection of two linear and time-invariant net- 
works of which one is noiseless and the other is dissipative, contains only 
thermal noise sources and ts held at a uniform temperature. 


I. INTRODUCTION 


The process of frequency conversion and its applications are well 
known and are extensively treated in the literature.-*? This paper con- 
siders the special case of a resistive diode frequency converter. An im- 
portant limitation on the minimum noise figure of such a frequency 
converter is imposed by the noise generated by the diode, and it is the 
main purpose of this paper to study the properties of this noise. 

Until a few years ago, much of the noise generated by the diode was 
1/f noise. Therefore, since very little was known about this type of 
noise, the early theories of frequency converters using positive resis- 
tance diodes paid little attention to the noise performance, and some- 
what later theories accounted for noise only in a very approximate 
way. However, as the semiconductor craft has developed, 1/f noise 
has been subject to considerable reduction and, even though its exact 
mechanism has not yet been completely established, in present diodes 
it appears to be important only at very low frequencies.?* Therefore, 
the study of shot and thermal noise in pumped diodes is of great prac- 
tical importance. 
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Strutt showed the method of treating shot and thermal noise in a 
pumped diode many years ago.® Since that time the method has been 
applied to tunnel diode frequency converters by a number of au- 
thors'*-°° However, in the case of a frequency converter using a posi- 
tive resistance diode, it is normally believed that, in order to calculate 
its noise figure, a detailed analysis of the noise behavior of the diode 
is not necessary.?® 12-14 

Consider a positive resistance diode which does not contain fre- 
quency dependent noise sources. That is, assume that, for any fixed 
voltage v applied to its terminals, the small-signal terminal behavior 
of the diode is equivalent to that of an ordinary resistor held at a uni- 
form temperature 7. The conductance g of this resistor is equal to the 
differential conductance of the diode and T is the so-called equivalent 
noise temperature of the diode. Since in a frequency converter the 
diode is pumped periodically by the pump, v varies with time. There- 
fore, g and T also vary with time, because they both depend upon v, 
and one can write g = g(t) and T = T(t). 

Normally it is convenient to represent the small-signal terminal be- 
havior of the diode by means of a linear and time-invariant network 
with several separate terminal pairs, one for each frequency of inter- 
est. A study of the noise behavior of this equivalent network generally 
requires that the self- and cross-power spectral densities of its short- 
circuit terminal currents, or of its open-circuit terminal voltages, be 
determined. Normally, however, the difficulty in determining the statis- 
tics of these noise terms is overcome by making the assumption that 
the equivalent network may be treated as an ordinary time-invariant 
dissipative system which contains only thermal noise sources and is 
held at a uniform temperature 7,. T’, is normally assumed to be equal 
to a certain time average of 7'(t). 

Even though no general proof has yet been given for this representa- 
tion, it is widely used, mainly because it greatly simplifies the treat- 
ment of the noise performance of a frequency converter. However, it 
is often viewed with reservations for several reasons.*5 One very im- 
portant reason is that it is generally applied to cases, in which one can 
easily show that it is not applicable, such as cases in which significant 
1/f noise is generated by the diode. Another reason is that its validity 
is not obvious even in the limiting case where the noise power available 
from the diode is frequency independent and does not vary with the 
applied voltage. In fact, even in this limiting case, it is often considered 
to be not strictly valid, 
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However, in this paper it is shown that, besides being valid under 
certain limiting conditions, such a representation can also be used for 
formulating and interpreting in a very simple way the noise behavior 
of a pumped diode under quite general conditions, including a negative 
resistance diode. 

In the following discussion it is assumed that the diode does not con- 
tain frequency-dependent noise sources, so that its small-signal term- 
inal behavior may be completely specified by the two time-varying 
parameters g(t) and 7'(t). Then it is shown that, in the limiting case 
where T' is a constant, the following theorem is true: 


Theorem 1: If a pumped resistive diode is characterized by a time- 
invariant equivalent noise temperature T, then its small-signal termi- 
nal behavior can be represented by means of a time-invariant equi- 
valent network which contains only thermal noise sources and 1s held 
at a uniform temperature T, = T. 


From this general theorem, which is already known to be valid un- 
der certain particular circuit conditions,?? a number of interesting re- 
sults can be derived. One important result is of course that, in a fre- 
quency converter which is bilateral and in which the noise temperature 
of the diode has negligible variations with time, the noise figure can be 
readily calculated. In fact, under these limiting conditions the noise 
figure can be related in a very simple way to T and to the dissipation 
characteristics of the circuit.?® 12-14 

Another important result is that, also in the general case where T'(t) 
is not a constant, the terminal behavior of the diode can be readily 
derived from theorem 1. This is a consequence of the following gen- 
eral property, which follows directly from the definition of T(t) and 
is stated as a theorem for emphasis: 


Theorem 2: Consider a pumped diode characterized by the time-vary- 
ing parameters T(t) and g(t). Its short-circwt noise current 8n(t) ts 
identical to that of a second diode characterized by a time-invariant 
temperature T2 and a differential conductance | g(t) | T(t)/To. 


According to theorem 1, this second diode can be represented by an 
equivalent network held at a uniform temperature T'2. Therefore, by 
applying to this equivalent network the generalized form of Nyquist’s 
theorem derived by Twiss,”* the correlations between the various fre- 
quency components of 5n(¢) can be readily determined. One finds that 
these correlations are simply equal to the Fourier coefficients of g(t). 
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This property is already known to be valid under certain particular 
circuit conditions.1*-?° 


Now, consider a linear, reciprocal, passive and time-invariant one- 
terminal pair network containing different elements held at different 
temperatures. It is well known’ that at a given frequency «, the effec- 
tive noise temperature of this network can be expressed as a weighted 
average of the various temperatures of the lossy elements. The weighting 
factors in this weighted average are simply equal to the amounts of 
power that are dissipated by the various lossy elements when the net- 
work is connected, at its two terminals, to a generator delivering a 
unit amount of power at the considered frequency w, . This result is 
extended, in Section VIII, to a reciprocal and linear network containing 
a time-varying resistance, by introducing the concept of average tem- 
perature T,, of a pumped resistive diode. The significance of this param- 
eter is best illustrated by the following example. 

Suppose that one wants to calculate the noise power available from 
the output terminals of a frequency down-converter. It is shown that, 
if the frequency converter is bilateral, this power can be calculated 
by replacing the diode with one having the same 1-v characteristic and 
a temperature equal to 7',,, where T,, is given by the relation 


_ (POT) 

sae 0) @ 
where (_ )ay indicates the time average and P(t) is the instantaneous 
small-signal power dissipated by the differential conductance of the 
diode when a small-signal generator is applied to the output terminals 
of the frequency converter. It is important to point out that 7’, de- 
pends, in general, both on the characteristics of the diode and on those 
of the circuit connected to it. 


II. SMALL SIGNAL EQUATIONS OF A NOISELESS PUMPED DIODE 


Let the diode current 7 be a nonlinear function f(v) of the terminal 
voltage v. It is assumed that the diode is pumped by a strong periodic 
source at a frequency w, and its harmonics. Therefore v and 7 contain 
large components v,(¢) and 7,(¢) of the type: 


v(t) = >> V;, exp jkwt (2) 
k=—0oo 


eo 


i.(t) = >> I, exp jkw,t. (3) 
k= 


—-o 
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It is assumed that v and 7 contain, in addition, small components $v (¢) 
and 62(¢) and it is desired to derive the relation between 8v(t) and 
62(¢), for the limiting case 8u(t) > 0 and 8&(t) — 0. Thus let 


v = of) = v(t) + dv(2), (4) 
t= a(t) = 1.(2) + &(Z). (5) 


The differential conductance of the diode is equal to the derivative of 
f(v). Let it be denoted by ga(v) and let 


gt) = galv.(é)]. (6) 
Since v,(¢) is periodic, also g(t) is periodic and therefore it can be 
written in the form 


g(t) = > gz exp jkwot. (7) 


Since 7 = f(v) and ga(v) is the derivative of f(v), from equations 4, 5, 
and 6 one has: 


di(t) = g(t) dv(¢) (8) 
in the limiting case 8v(t) — 0. This relation completely describes the 
small-signal terminal behavior of the diode, in the absence of internal 
noise sources. 


III. SMALL SIGNAL EQUATIONS OF A NOISELESS DIODE IN THE 
FREQUENCY DOMAIN 


From equations 7 and 8 the relations between the different frequency 
components of 6v(é) and é7(¢) can be readily derived*’’. In fact, assume 
that both év(t) and éz(t) contain components at only the pairs of side- 
frequencies kw, + p and kw, — p (| k| = 0, 1, 2, ete.; 2p < w,). Then 
dv(t) and 62(t) can be expressed as follows: 


dv(t) = are > Vax exp j(p + kw,)t + pz Ver exp j(p — ku )e | (9) 


di(t) = are 3 Tax exp j(p + ke,)t + 3 Is, exp j(p — kat | (10) 


and, on substituting equations 9, 10, and 7 into equation 8, one obtains 
the following relations between the Fourier coefficients of the various 
frequency components of dv(té) and 8(¢): 


Toe = > Grae + a Ode ge (Y = 0,1, ete.) (11) 
k=0 =1 
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Toe = Zi gerVoe+ Di g-rwVer (= 1,2, ete) (12) 
=1 k=O 


which can be written in the form: 


ey = pee ee fe (13) 
Ts] [Gea] [Ges JiL Ve] 


where the matrix notation is defined as follows: 


Lao Ta Vie Var 

D2 Igo Vai Vo0 
Ll=|7 ) tl=|p.) Vl=lyf Yl=ly,| O49 

and the elements of the matrices [Gaa], [Gof], etc., are 

(Gaalese = Gr-r (7,k =0,1, ete.) (15) 

(Gop)r.n = Gu-r (7,k =1,2, etc.) (16) 

(Gap)r.k = Grtk (,k —1=0,1, etc.) (17) 

(Gealrk = J-r-k (r —1,k=0,1, etc.). (18) 


Equations 13 through 18 completely specify the terminal behavior 
of the diode at the frequencies p + kw in the absence of internal noise 
sources. 


IV. SMALL SIGNAL TERMINAL BEHAVIOR OF A NOISY DIODE 


Up to this point the noise generated by the diode has been ignored. 
In the general case of a noisy diode equation 8 has to be modified as 
follows: 


514) = gt) svt) + dni) (19) 
where 8n(t) is the equivalent short-circuit noise current of the diode. 
Equation 19 corresponds to the equivalent circuit shown in Fig. 1 in 
which the spontaneous fluctuations of the diode are ascribed to a cur- 
rent generator of infinite internal impedence, acting in parallel to the 
differential conductance of the diode. 

Now, consider the components of én(t) occurring in an infinitesimal 
frequency range between w — (dw)/2 and w + (dw)/2. It is convenient 
to account for these components by means of a single pseudosinusoid 
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nit} t vit) 
git) 


Fig. 1— Equivalent circuit of a time-varying conductance containing noise 
sources. 


with random complex amplitude”~**. Then, let Na, and Nz, (k = 
0, 1, 2, etc.; r = 1, 2, etc.) be the complex Fourier amplitudes of the 
pseudosinusoids relative to the frequencies p + kw, and p — kw,, 
respectively. Noise components occurring at frequencies different from 
these will be neglected since they have no effect on the small-signal 
terminal behavior of the diode at the frequencies p + kw, . Then 


én(t) = 2Ro\ > Naz exp j(p + kw,)t + > No exp j(p — kwpt} (20) 


and from equations 13 and 19 one obtains: 
be a re iu i * *- (21) 
Ig] Gea] [Geplt LV] Ne] 


Nao Nar 


and Ng] = |], (22) 


where 


Equation 21 completely specifies the small-signal terminal behavior 
of a pumped diode containing noise sources. Its physical interpretation 
is often facilitated by introducing the equivalent circuit of Fig. 2. In 
this equivalent circuit the diode is represented by a linear and time- 
invariant network in which the terminal voltages and currents occur 
at the same frequency. Their Fourier coefficients are equal to those of 
the various frequency components of v(t) and 8&(t). 

The network of Fig. 2 is completely specified with respect to its ter- 
minal pairs by its admittance matrix 


IG] = rae a (23) 
[Geos [Gee] 
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lao 
Vao { Nao 
Var] Tat 


Ta —p 
Vai} Net 


v| 


1]—> I pt 


+E 


[¢] 








ee@e 
Tae 
Oo 


Fig. 2— Time-invariant equivalent network of a pumped diode. 


and by the noise column matrix 


Nil 
mLya on 


which represents the complex Fourier amplitudes of its short-circuit 
terminal currents. The self- and cross-power spectral densities of these 
noise currents are 


and 


(Nearer)  (NoN5,) 
G B : (25) 


df ; 
where ( ) indicates the statistical average. They are conveniently rep- 
resented by the matrix 


(N aN) 
df 
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1 t 
dj (NIN]), (26) 


where superscript’ denotes the Hermitian conjugate. 

Now let the properties of the equivalent network of Fig. 2 be briefly 
examined. Since g, = g*, (k = 0, 1, 2, etc.), from equations 15 through 
18 one has that the admittance matrix [G] is Hermitian, that is: 


[G] = [Gy]. (27) 


It is interesting to note that this condition is equivalent to the condition 
that 


(IM) (VI'IG]V]) = 0 (28) 


for all V], which requires that the total reactive power flowing into the 
nonlinear resistance at the various side frequencies p + kw (k = 
0, 1, ete.) be always zero. This property is a direct consequence of the 
general energy relations derived by Manley and Rowe for nonlinear 
resistors.”” Because of equation 27 the average small-signal power dis- 
sipated in the admittance g(t) can be expressed as 


(50()"9) av = VIG] + [G]*)V] = 2V}'[G]V]. (29) 
Therefore, if 


g(t) > 0 (80) 
at all times, then [G] is both Hermitian and positive definite and the 
equivalent network is dissipative. 

Now, consider a linear and dissipative network which contains only 
thermal noise sources and is characterized by an admittance matrix 
equal to [G]. If such a network is held at a uniform temperature T, 
then the various spectral densities of its short-circuit terminal currents 
are simply given by the elements of the matrix 


kT((G] + [G]"). 
From this generalized form of Nyquist’s Theorem, proved by Twiss,”* 
and from equation 27 one has that, if condition 30 is satisfied and the 
matrix 26 satisfies the relation 
(NIN]}') = 2kTaf(Gl, (31) 


then the small-signal terminal behavior of the diode can be represented 
by means of an equivalent network which contains only thermal noise 
sources and is held at a uniform temperature 7’. 
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Of special importance is the particular case in which the circuit con- 
nected to the diode is resistive at the harmonics 2w,, 3w, , etc., of the 
pump frequency and, at these frequencies, does not contain generators. 
Under these conditions it is always possible to choose the origin of time 
in such a way as to make »v,(2), 7.(¢) and g(t) even functions of time.’ 
In this case, since all of the coefficients g, (k = 0, 1, etc.) become real, 


Ju = J-z» (32) 


and therefore [G] becomes a real symmetric matrix, because of 
equations 15 through 18. If, in addition to equation 32, condition 
30 is satisfied, then the equivalent network of Fig. 2 can be realized 
by means of an ordinary resistive network.’ 

Of course, in the general case where the origin of time cannot 
be chosen to make all the coefficients g; real, the diode cannot be 
represented by a reciprocal (bilateral) network. 

Condition 31 is never satisfied if g(t) becomes negative for some 
values of t. In fact in this case [G] is indefinite, while (V]N]*) is 
always a positive definite or semidefinite matrix. On the other hand, if 


g(t) <0 (33) 


for all values of t, then [G'] is negative definite and of special interest 
becomes the condition 


(NIN}') = —2kT dj{Gl. (34) 


In fact, consider a pumped negative resistance diode which satisfies 
this condition. If a frequency converter is made from such a diode 
by imbedding it in a lossless network, then its noise measure, defined 
by Hauss and Adler,°° is independent of the characteristics of the 
lossless network and is simply equal to 7'/T.,, where J, is standard 
temperature, 290°K. Therefore, if G, is the exchangeable gain of such 
a frequency converter, its noise figure / is simply equal to 


F=1+T7/T, 0 — 1/G,). (35) 


V. SHOT NOISE IN A PUMPED DIODE 


Assume that the diode only contains shot noise sources and that 
in the frequency range of interest transit time effects can be neg- 
lected.® 11, 26, 31, 32 

Assume for the moment that the voltage v applied to the diode 
is time-invariant. Then 6n(t) can be treated as white noise over the 
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frequency range of practical interest. Therefore, if S denotes its spec- 
tral density, it can be expressed as 


én(t) = S'x(t), (36) 


where x(t) is white noise with unit spectral density. S will in general 
depend upon the voltage v applied to the diode and it is convenient 
to express this voltage dependence in the following form: 


S = Sa(v) = 2kT (rv) | ga(v) | (37) 


where kTa(v)/2 represents the exchangeable noise power at the 
diode terminals, per unit bandwidth. In equation 37 the occurrence 
of the factor 2, in place of the usual factor 4, results from the fact 
that here both positive and negative frequencies are considered. 

Now, consider the general case where v is not a constant and let 
the concise notation 


Tt) = T.lv.(2)] 
SQ = Sav.()] = 2kTH) | 9 | 


(38) 


be introduced. Then $n(¢) results from the superposition of statis- 
tically independent random disturbances whose probability of occur- 
rence is proportional to the deterministic and periodic function 


h(t) = [S@]? (39) 


Since it is assumed that the duration of these disturbances is much 
smaller than the reciprocal of the highest significant frequency 
of h(t), equation 36 is still applicable and therefore . 


én(t) = h)x(d). (40) 


Now let consideration be restricted to the fluctuation components 
occurring in infinitesimal frequency intervals of width df, centered at 
the frequencies p + kw,. Then, since x(t) is white noise with unit 
spectral density, from equation 40 one has that 6n(é) can be expressed 
as follows:*’”° 


an(t) = h(i) D> 2(@H)* cos [ser + pl) + vl (41) 


where y, are statistically independent random phase angles distributed 
uniformly over the range (0, 27). 


1894 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1968 


Let h(t) and S(t) be represented by the Fourier series 


S(t) = >> S, exp jkw,é (42) 
k=e—o 

h(t) = >> A, exp jkw,t. (43) 
k=—0 


From equation 39 one has that S, and H;, are related through the 
relations: 


S, = dy A,H,-: ‘ (44) 
k=~00 


Introduction of equation 43 in equation 41 gives: 
in(d) = (a YD H,fesp ills + est + pt + ol 
+ exp j[¢ — s)w.t — pt — ¢.]} 
= (df)? > pa Hy-s EXP je. exp jlkw. + pt 


s=a—coo k= 


+ H,-1 exp —jy, exp —j(kw, + p)t (45) 


= 2¢ree) (a! p> > H,-, exp jy, exp jlkw, + 7) i, 


From this last relation and from equation 20 one obtains 


oO 


Nax = (df)? 2 Hy, EXP Jes (46) 
Nea = (af) ye H_y-. XP je, - (47) 
Hence, since 
: : 1, r=s§ 
(exp J. exp —Jy,) = ; 
0, rx*s 


from equations 46 and 47 one obtains: 


N arN ar = 
Tastee i es: fz (aan; Sn 


= Diet 


(48) 
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and 


Wai Bh) AH 


df — —, rts (49) 


Now, consider a time-varying conductance equal to S(t) and let 
[S] denote its admittance matrix. Then [S] is obtained from equations 
15 through 18, and 23 by formally replacing g, and G with S, and S 
throughout. Therefore the elements of [S] are equal to the various 
Fourier coefficients S, and, from equations 48 and 49, one obtains 
the final result 


(NIN}') = af{S}. (50) 
VI. NOISE BEHAVIOR OF THE DIODE 


The preceding section showed that if a diode contains only shot 
noise sources, then the self- and cross-power spectral densities of 
its short-circuit terminal currents are simply equal to the Fourier 
coefficients of 2k7'(t) |g(t)|, over the frequency range of practical 
interest. Let us examine the significance of these relations, which 
are valid even if the diode contains thermal noise sources. 

First, consider the special case of a positive resistance diode char- 
acterized by an equivalent noise temperature Ta(v) which is ap- 
proximately constant over the range of voltages of interest, so that 
the approximation 


T.(v) = T = constant (51) 
can be made. In this case since equations 38 give 
Sz. = 2kT 9, , (52) 
one has 
[S] = 2kT[G] (53) 


and therefore from equation 50 it follows that the spectral density 
matrix satisfies condition 31. One concludes that, if 7'4(v) is inde- 
pendent of v and condition 380 is satisfied, then the small-signal 
terminal behavior of the diode can be represented by a time-invariant 
dissipative network held at a uniform temperature T’,, as stated in 
theorem 1. 

Thus, in the limiting case (51) and under the restriction g(t) > 0, 
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equation 50 can be interpreted as a direct consequence of theorem 
1 and Nyquist’s theorem. Now, a little reflection shows why equation 
50 also is valid in the general case where 7'(¢) is not a constant 
and the restriction g(t) > 0 is removed. In fact, two diodes having 
the same S(t) have the same short-circuit terminal currents, no 
matter what their differential admittances may be. Notice that from 
this rather obvious property theorem 2 follows at once. That is, the 
short-circuit terminal currents of a diode characterized by an equi- 
valent noise temperature 7'(t) and by a differential conductance 
g(t) are identical to those of a diode characterized by a constant 
temperature 7’. and a differential conductance 


lg) ((TOV/T., (54) 


where 7, is an arbitrary temperature. It is important to point out 
that, even though the foregoing two diodes have the same short-circuit 
terminal currents, they are not equivalent since they have different 
conductances. On the other hand, the terminal behavior of a diode 
characterized by a voltage-dependent temperature Tz(v) and a con- 
ductance gg(v) is equal to that of the parallel connection of the two 
diodes (see Fig. 3) with voltage-independent temperatures T and 
T. and with the differential conductances gai(v) and ggo(v) defined 
by the following equations: 


gar) + gas?) = galr) (55) 
gaW)Ts + gacd)T's = gav)T2@) (56) 

where 7'; and T» are subject to the only condition 
T, < Ta) < T, (57) 


which guarantees that gai(v), ga2(v) and ga(v) have all the same 
sign. Notice that the equivalent circuit of Fig. 3 and the original 
diode have the same short-circuit terminal currents because of equa- 


I 


Tg), ggtV) | T149dy (Vv) T21 Ida (Vv) 


Fig. 3— Representation of an arbitrary noisy resistive diode by means of two 
diodes with voltage-independent noise temperatures 7, and 7%. 
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tion 56, and have the same differential conductances because of 
equation 55. An important feature of this equivalent circuit is that 
theorem 1 is applicable to both diodes and it can therefore be studied 
by standard techniques. Particularly interesting is the limiting case 
T, = 0. In fact in this case one of the two diodes becomes noiseless 
and the other has the time-varying conductance 


go(t) = gale.) = gQ(TOMT:; (58) 
when the pump voltage v,(t) is applied to it. Hence, by comparing 
equation 54 with equation 58 one obtains the following result. If 
g(t) > 0 and T, > T(#) for all values of ¢, then, by connecting a 
noiseless diode having the conductance 


nt) = gale] = g@[T2 — THVT. (59) 


in parallel with the second diode of theorem 2, one obtains a circuit 
completely equivalent to the original diode. 

Now, consider the case where g(t) < O for all values of ¢ and 
suppose that condition 51 is satisfied. Then 


[S] = —2kT7[G] (60) 


and from equation 50 one has that condition 34 is satisfied. Hence, 
the remarks about this possibility at the end of Section III apply. 
In general, where T(t) is not a constant, equation 35 is not valid. 
However, if 7; and JT. are the minimum and maximum values of 
T(t), so that 


NsTOs lf, (61) 


then one can say that the noise performance of the diode will be 
bounded by the two limiting values obtained from equation 35 for 
the two limiting cases T = T; and T = To. 


VII. TERMINAL BEHAVIOR OF THE DIODE IN THE IMPEDANCE-MATRIX 
REPRESENTATION 


In some cases it is convenient to use the impedance-matrix rep- 
resentation, rather than the admittance-matrix representation, for 
describing the terminal behavior of the diode. Let 


r(é) = 1/g() = = r, exp jkw,t (62) 


be the differential resistance of the diode, Then the impedance-matrix 
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representation of the small-signal terminal behavior of the diode can 
be written in the form 


V] = [RJ] + N,] (63) 
where the relations between the elements of the impedance-matrix 
[R] and the coefficients 7;, are identical to those between the elements 
of [G] and g;, (see equations 15 through 18). The column matrix 


N,] consists of the amplitudes of the open-circuit terminal voltages 
of the diode. If 


o 


O(2) = 2kT() |r(t) | = >> 0, exp jkw,t (64) 


“oO 


then one has 


(N,IN,]*) = df{0] (65) 
where [0] can be obtained from equations 15 through 18, and 238 
by replacing G and g, with 0 and 0; throughout. Notice that equa- 
tion 65, which is analogous to equation 50, follows from theorem 1, 
Nyquist’s theorem** and the fact that two diodes having the same 


O(t) have the same N,], no matter what their differential resistances 
may be. 


VIII, AVERAGE TEMPERATURE OF A PUMPED DIODE 


Consider a linear, reciprocal, passive and time-invariant one-terminal 
pair network containing different elements held at different tempera- 
tures. It is well known” that at a given frequency w, the effective noise 
temperature of this network can be expressed as a weighted average of 
the various temperatures of the lossy elements. The weighting factors 
in this weighted average are simply equal to the amounts of power that 
are dissipated by the various lossy elements when the network is con- 
nected, at its two terminals, to a generator delivering a unit amount of 
power at the considered frequency w, . This result is extended, in this 
section, to a pumped diode. 

The concept of average noise temperature T',, of a pumped diode 
is introduced in this section. Consideration is restricted to the case 
where g(t) = O and condition 32 is satisfied, so that the equivalent 
circuit of the diode is passive and bilateral. It is shown that 7’, de- 
pends, in general, both on the characteristics of the diode and on those 
of the linear and time-invariant circuit connected to it. However, if 
certain conditions are satisfied, then it only depends on the diode 
characteristics. 
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Consider a one-terminal-pair network N consisting of a pumped 
diode imbedded in a linear, time-invariant and bilateral two-terminal- 
pairs network WN’ (Fig. 4). Assume, furthermore, that the network 
N can exchange power at a single frequency «, at its terminals. Then 
the average temperature 7',, of the diode is defined in the following 
way: 

Tay is such that the noise power available from the terminals of 
the network N does not change if the actual temperature-voltage 
characteristic T'a(v) of the diode is replaced with a constant tem- 
perature equal to T,,. 

Now let a small-signal generator of frequency ; be connected to 
the terminals of the network N, and let 8:(¢) and 8v(t) be the small 
signals produced at the diode terminals. It will be shown that 


: ee" ) sold) T(t) at 
a (66) 


| 5 sol) dl 


Notice that this equation is equivalent to equation 1. 
Proof: Itis convenient to represent the circuit of Fig. 4 by means of the 
equivalent circuit of Fig. 5, where the network D represents the small- 
signal terminal behavior of the diode and each terminal pair of D 
exchanges power at only one frequency. Notice that in Fig. 5 the net- 
work N’ has been represented by means of several separate equivalent 
circuits, N{, N{, etc., one for each frequency of interest. 

The network D can be decomposed into two separate networks 
Dy, and De, each held at a uniform temperature. 

In fact, let the diode of Fig. 4 be replaced by the two diodes shown 
in Fig. 3 and let 82, (¢) and 82(¢) be the small-signal currents of the 











Fig. 4— Diode imbedded in a linear, time-invariant and bilateral network N’. 
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Fig. 5 — Equivalent circuit of the network N of Fig. 4. 
two diodes. Then, from equation 56 one has: 


du(t) 62 Ti) = Sv(t) 64,1) 7, + dv) 622.() T. 
which gives: 


(Bu(0) Bi) TD)ow _ (B0(0) Fiat (B0(0) Bis()YaPe, (gy 
(dv(4) 82(2))av (dv(t) 8¢(2) aw 
Since theorem 1 is applicable to both diodes of Fig. 3, it is clear 
that the network D of Fig. 5 can be represented by the parallel 
connection of two networks (D, and Dez) of which one is held at a 
uniform temperature 7, and dissipates an average power equal to 
(Sv (t) 821 (t))ay, and the other is at JT’, and dissipates (8v (t) 8te(t) Jay. 
Now, since both D; and Dz are bilateral, the noise power available 
from the network N of Fig. 5 does not vary*® if the temperatures 
of D, and Dz are changed so that they become equal to 


(u(t) 8014) avs + (60(L) Stn(2) av T's 
(dv(2) dti(1) av + (dv(t) 5t2(t))av 


which, together with eq. 67 gives eq. 66. 

Equation 66 is of particular interest when w, = 7, since in this case 
it corresponds to the example considered in the introduction. Notice 
that 7,, is not, in general, a function of the diode characteristics alone. 
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In fact, it also depends both on the particular characteristics of the 
network N’ in which the diode is imbedded and on the value of the 
frequency w, at which 7’, is defined, unless 7'(t) is constant. 

Under conditions of practical interest 7 always varies with time and 
consequently a direct application of theorem 1 is never strictly valid. 
Equation 66 shows, however, that if certain conditions are satisfied, 
then T,, is little affected by the particular choice of w, and N’, and 
consequently a direct application of theorem 1 may not introduce 
significant errors. More precisely, suppose that either 





1 
~e ~ 0 68 
ga(v) 0 or ga(v) ( ) 
for some values of v and that 
T,(v) = T’ = constant (69) 


over the range of voltages for which conditions.68 are not satisfied. 
Under these conditions either T'(é) = T’ or dv(t) 62(t) = 0, for all values 
of t, and consequently from equation 66 one obtains T,, ~ TJ’. There- 
fore in this case, and only in this case, 7’, can be regarded as a function 
of the diode characteristics alone and theorem 1 is applicable, with T 
replaced by 7”. 

An important application of the preceding result is given by an 
ideal Schottky barrier diode. In fact, the relations derived in Ref. 
22 between the noise figure and the conversion loss of such a diode 
imply that its junction can be represented, under certain particular 
conditions, by means of an ordinary resistive network held at half 
the temperature 7’, of the junction. 
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The Analysis of Circular Waveguide 
Phased Arrays* 


By N. AMITAY and VICTOR GALINDO 
(Manuscript received July 9, 1968) 


In this work, a planar phased array of circular waveguides arranged in 
an equilateral triangular grid ts analyzed. The boundary value problem ts 
first formulated rather generally in terms of a vector two dimensional integral 
equation for an array of elements that are arranged in a doubly periodic grid 
along two skewed (nonorthogonal) coordinates. Dielectric plugs, covers, and 
loading, as well as thin irises at the aperture, are accounted for in the 
formulation. Numerical solutions are obtained by using the Ritz-Galerkin 
method to solve the integral equation. Excellent agreement with experimental 
measurements using a waveguide simulator ts observed. The existence of 
forced surface wave phenomena in equilateral triangular grid arrays and 
their strong dependence upon the mode of excitation is also demonstrated. 
These phenomena are shown to exist at tsolated points in the scan co- 
ordinates. Reflection characteristics as well as the polarization characteristics 
of the radiation pattern are illustrated at selected planes of scan for both 
linear and circular polarization excitation. 


I. INTRODUCTION 


The requirements of modern radar and communication systems have 
stimulated considerable activity in the design and use of phased 
array antennas. To date, the design information required for their 
development has been obtained from the analysis of simplified array 
models and from experimental data. The great speed and storage 
capacity of present day digital computers, however, have now made 
it possible to solve the planar phased array boundary value problem 
very accurately.? ? 

A general formulation of the planar phased array boundary value 
problem may be found. A vector two dimensional integral equation 


* The work reported in this paper was supported by the U. S. Army Materiel 
Command under contract DA-30-069-AMC-333(Y). 
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for the tangential aperture field (that is, the tangential field at the 
planar interface between the waveguides and free space) can then 
be derived. 

In its most general form the array elements are assumed to be 
arranged in a doubly periodic grid along two skewed (nonorthogonal) 
coordinates; and dielectric loading, covers, and plugs, as well as thin 
irises at the aperture plane, may be accounted for in the analysis. 
The possibility of multimode excitation of the array has also been 
included. The Ritz-Galerkin method is applied to obtain a solution 
for circular waveguide arrays. 

Numerical solutions for the reflection characteristics of dielectric- 
free planar arrays of circular waveguides hexagonally arranged in a 
conducting ground plane have been carried out. Experimental meas- 
urements have been made which compare favorably with the results. 
Forced surface waves? are found to occur at isolated points in the 
scan coordinates and can be related to certain vector and geometrical 
symmetries for an equilateral triangle grid array. These surface waves 
(or resonances) are often difficult to locate experimentally by the use 
of waveguide simulators*?* or small test arrays. The strong depend- 
ence of these forced resonances upon the mode of excitation is also 
demonstrated. The reflection characteristics as well as the polarization 
characteristics of the radiation pattern are illustrated for various 
combinations of linear and circular polarization excitation of the 
array. 


II. ANALYSIS 


An infinite planar array of waveguide elements, Fig. 1, is imbedded 
in a conducting ground plane at its interface (plane z = 0) with free 
space. The elements are arranged in a periodic grid along the skewed 
(nonorthogonal) coordinates s; and sz. The x and s; axes coincide 
while the sp axis makes an angle a with respect to the x (and s;) 
axis. The element location is defined by two indices (p, g) correspond- 
ing to a physical location 


Ore = pbs, + qd8, (1) 


where §, and 82 are unit vectors along the s; and s, axes, while b and d 
represent the basic periods of the two dimensional grid. A basic 
periodic cell,15 the parallelogram shown in Fig. 1, is thereby defined. 
If the array elements are excited uniformly in amplitude with a linear 
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Fig. 1— Circular waveguide array geometry. 
phase taper such that the phase of the (p, q)th element is 


Vog = DY. + Qt, » (2) 


then the resulting electromagnetic fields in the (p, q)th and (m, n)th 
periodic cells satisfy the following periodicity relationship 


E(0¢) = E(@mn) exp [¢ {(m — p)i, + (n — ¥x,3] (3) 


where E(o,,) may designate the electric or magnetic field at the (p, q)th 
periodic cell of the grid.* Therefore, except for a phase factor, the fields 
in all the cells are identical. 

In order to solve the boundary value problem, the exterior (free space) 
fields are expressed in terms of a complete set of Floquet type solutions 
of Maxwell’s equations { W?,, exp + 7B,,,2}. These vector modes, which 
are functions of the steering phases y,, and y,, are derived in Appendix 
A. The interior (2 S 0) fields are expressed in terms of the appropriate 
waveguide complete orthonormal set of vector modes {®; exp + 7Tj2}.7 
The boundary value problem is expressed in terms of an integral equa- 
tion which includes the necessary continuity conditions. This equation 
is formulated’ by satisfying the continuity of the transverse (to 2) 


* The parallelogram in Fig. 1 defines the (p = 0, q = 0) cell. The (p, q)th cell 
is translated by pb and qd along the s: and sz axes, respectively. 

t The waveguide modes are real functions and in general consist of both TE 
and TM modes with double subscripts. However, one can always systematically 
relabel these modes with a single subscript according to the increasing values of 
the eigenvalues, 
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electric and magnetic fields within a single: periodic cell. As shown in 
Appendix B, the periodic cell consisting of the parallelogram CDEF of 
Fig. 2 can be replaced, without a loss of generality, by the parallelogram 
GHIJ (or any other periodic contiguous cell containing a complete 
single waveguide aperture). 








Lp < : . 
D 


Fig. 2 — Periodic cell in skewed array geometry. 


A 






The tangential electric and magnetic fields at the array interface 
(E and H at z = 0) can be expressed in terms of a Fourier series of the 
complete orthonormal set of waveguide modes {@;} for z S$ 0 and by the 
set of Floquet type modes {¥W”,} for z2 = 0. Let the waveguides be 
excited by any linear combination of their propagating modes* with 
amplitudes A;(7 = 1, ... , J for J propagating modes) and let the 
coefficients R; represent the amplitudes of the corresponding reflected 


*It is straightforward to include, if desired, any linear combination of both 
propagating and evanescent modes, 
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modes.* Let the coefficients D; represent the amplitudes of the reflected 
evanescent modes which are generated at the aperture. Then, in terms 
of the waveguide modes, the electric field at 2 = 0 is given by 
J 0 
>> (4; +R) ®; + DS De; (over the waveguide aperture) 
jal+l (4) 


E= ji 
0 (over the rest of the periodic cell). 


The corresponding magnetic field is 


oO 


J 
» YA; — RB)’; — Dd) Y;D,®, 
7=1 7aJ+l 


i t 
= ae (over the waveguide aperture) 5) 


0 (over the rest of the periodic cell). 
where the modal admittances {Y;} are real for propagating modes and 
pure imaginary for evanescent modes. These admittances are given by?® 
T; € 


Y,;=— for TE modes; Y, = os for TM modes (6) 


= 


for an exp [—iwt] time convention with the I, (the z propagation con- 
stant) being positive imaginary for evanscent modes. The tangential 
electric field at 2 = 0*, expressed in terms of the Floquet type modes, is 


2 
EJs Do Pa (over the periodic cell) (7) 
p=1 (mm) (n) 


where the superscript p designates TE(p = 1) or TM(p = 2) modes. 
The magnetic field is correspondingly given by 


2 
—H, = SS SS » Dg otal aon (8) 


p=1 (m) (n) 


where the modal admittances Y/,,, are given by 


Vi Ses “yes Sees: (9) 


? mn2 
Wu Bias 
From the orthonormality of the sets {®,} and {W?,,} it is clear that 


0) = If hee: dao for jsJ (10) 
A D; jo 





* Actually {Rj} can represent the reflection coefficients once {Aj} are properly 
normalized, 
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where A is the waveguide aperture, and 


g.,we) = ff E.crnytda= ff B.-OrE)* da = Pay GD 
A 


periodic cell 


where the symbol * denotes the complex conjugate. Notice that E, 
vanishes on the conducting ground plane. To insure the continuity 
of the tangential fields across the aperture at z = 0, one requires 


E, = E_ = E, over the aperture and the periodic cell (12) 
while 


H, = H_ = H, over the aperture only. (13) 


Using the various relations, (4) through (13), one obtains an integral 
equation for the tangential electric field E, at the array interface 


J ) 
2 > A;Y,®; — > Y ,®; [[ ©,-E. da 
7=1 7=1 <i 
A 


2 
t+ ULV VD viwwr ff OB da. (4) 
p=1 (m) (n) fe 

Similarly, one can obtain’ an integral equation for H, which is defined 
over the entire periodic cell. Under certain conditions’’ it is possible to 
interchange the order of summation and integration in (14) and thereby 
obtain the usual form of Fredholm integral equation of the first kind. 

A useful method of obtaining a solution to (14) is by the application 
of the Ritz-Galerkin method,** whereby the integral equation is reduced 
to a linear matrix equation. Substituting (4) in (14) for E, and taking the 
moments of (14) with respect to the set {®,}, while using (10), leads to 
the following matrix equation 


Y,A, A, + R, 
2 Y,Ayz = | K | Ay, = Ry; : (15) 


0 Dy+1 
where | K| is a square matrix with the (2, q)th element given by 


2 
bg Vi Ge Dy DD, Vi CaniCong (16) 


p=l (m) (n) 
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In (16) 8,, is the kroenecker delta 


big = f aa (17) 
ie ae 
and 
C2, = if | Ww? ., da (18) 
A 


is the coupling coefficient between the designated interior and exterior 


modes.* 

The practicality of an accurate numerical solution or approximate 
analytical solution often hinges upon obtaining C%,,; in closed form. 
Recently the authors’ have obtained closed form expressions of these 
coupling coefficients for circular as well as coaxial waveguides. 

A solution for the aperture field in terms of the coefficients of the 


waveguide modes is given by 


A,+ Rk, ALY, 
° A, +R, eae eu ie A,Y,; . (19) 


Dys1 0 


The solution vector can be obtained by matrix inversion or by rapidly 
convergent iterative methods.”’”° A similar procedure should be followed 
to obtain H, except that the aperture field and the moments may be 
taken with respect to the set { W?,,}. 

Once the aperture electric field is obtained, the input impedance 
and radiation properties of the array, as a function of scan, are easily 
obtained. The reflection coefficients are obtained directly from (19), 
as are the amplitudes of the evanescent modes in the waveguides. The 
radiation pattern of a single element in the array environment, includ- 
ing its polarization characteristics, can also be easily obtained.? 7? 

The addition of either a dielectric sheath or plug or both to the 


* Notice that other sets of functions {£:} can be used to reduce (14) by the method 
of moments. However, the integrability of 
Sf cay Bis and Sf cay Wan? + Er 
may prove difficult, depending on £;. The convergence properties as a function of 
the order of K will also be influenced by the choice of {£7}. 
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array, Fig. 3, does not alter the functional form of the integral equa- 
tion (14). As was shown by Galindo and Wu,??? only the modal 
admittances in (14) have to be replaced. For the case when dielectric 
plugs are used, Y; is replaced by 


Y; — ty; tan y;t 


Yi Ui y; — 1Y; tan vit 


(20) 
where y; and 7; are the modal admittance and propagation constants, 
respectively, of the ®; mode in the dielectric and ¢, is the dielectric plug 
thickness. The relation between the reflection coefficients of the propa- 
gating modes in the dielectric-free region (€)) and the aperture field is 
given by 


Yi i E,-@; da — A; exp(—2Tt,)[y; cos y;t, + 7Y; sin y;t,] 


R; = A : . ae ty 
exp(T,t,)[y; cos y;t, — tY; sin 7;t,] (21) 
the phase of R; being referred to the aperture. Similarly, for the 
exterior dielectric sheath covering the array, the modal admittances 
are replaced by 

gone ae Vissi tan Bmntt 


/ , 
: gane = Ymnp 7 


Ynnp OY snp tan Bion é 22) 


where y/’,,, and Bn, are the modal admittance and propagation constants, 
respectively, of the W%,, mode in the sheath, and ¢{ is the sheath thick- 
ness. The coefficient of the mode W”,, in the free space region above the 
sheath, Finny » 18 related to the aperture field by the following relation: 


exp(=7Brnti)Yinn if 
FS Pn Oe ee Pe Pe Se ba ae 7 E, .VupPr* 
eee Ymnp cos Bb? — ty sin Bact 1 ao da, (23) 
periodic 
cell 


the phase of F,,,, being referred to the aperture plane. 

The integral equation formulation can be extended to the case when 
thin metallic irises are present at the waveguide aperture for matching 
purposes (see Fig. 4). The integral equation for the aperture electric 
field, (14), is still valid except that the integral has to be defined over the 
effective aperture with the result that the orthogonal relations of (10) 
and (11) cannot be used for the Ritz-Galerkin method of solution.* 
The modal coupling coefficients, (18), are still integrable in closed form 


* An integral equation for the magnetic field is not valid in this case because of 
the discontinuity of H, across the iris. 
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Fig. 3 — Dielectric sheath and plug geometry. 


over the effective aperture if the iris is circularly symmetric. However, 
the matrix elements, as given by (16), do change in form when an iris is 
present at the aperture. As a result of the ®; not being orthogonal over 
the new effective aperture, the term Y;6;, in (16) is replaced by an infi- 
nite sum. 

Multimode excitation of waveguide antenna fields has been used for 
primary pattern control. Such excitations may prove useful for the 
reduction of mutual coupling effects.” They also may be used to obtain 
circular or elliptical polarization from two linearly polarized modes and 
improve the polarization characteristics of the array. In the circular 
waveguide array, the horizontal TE,, (®,) and vertical TE,, (®,) modes 
are degenerate (that is, they have the same z-directed propagation 
constant and impedance). In order to obtain a linear, elliptical or 
circular polarization excitation of the waveguide, one may redefine the 
first two modes as ®,y and @yy: 


A 


1 = Ay 
(| Ai P+ | Ae |)’ 


lo A i ee 
(AAs 





Div = ®D, (24) 


EFFECTIVE 7 THIN IRIS 
7 


APERTURE, , 


/ 


Fig. 4— Aperture iris geometry. - 
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_ | Az | - A¥ | A, 

Pe = “CAP + [ae PE ARTA P+ 4 BD 
This redefinition of the first two modes preserves the orthonormality 
and completeness of the set of waveguide modes while allowing the 
flexibility in adjusting the desired array excitation. The reflection 
coefficients which correspond to these redefined modes are Riy and 
Roy. The polarization characteristics of the radiation pattern may 
be determined from the 6 and ¢ components of the radiated field, 
Eo and Ey, respectively. After proper normalization of Hg and Ey one 
may obtain the corresponding transmission coefficients?® 


r= (Se) aa ff Eecrior aa 6 


(aperture) 


~ (Ee) i E,-(w,)* da (27) 


(aperture) 


5h, (25) 





T's 


where 7’, and 7’, define the scan angle directional cosines. When the 
first two waveguide modes are the only propagating ones and while 
only a single lobe propagates in the free space, equations (24) through 
(27) are related by the conservation of energy relation: 


| Riv P + | Bow + | To [+ | 7 = 1. (28) 
For more than one lobe in free space or additional propagating modes 
in the waveguides, (28) has to be accordingly modified. 


III. NUMERICAL AND EXPERIMENTAL RESULTS 


In order to obtain a numerical solution for the aperture field, the 
infinite dimensional matrix of equation (15) must be truncated and 
cast in a finite dimensional form. In other words, the electromagnetic 
fields will be approximated by a finite Fourier series of the waveguide 
and free space modes, and consequently the solution of the problem 
as given by (19), is finite dimensional as well. 

In numerical solutions of problems of this type, various ways of 
ascertaining the validity and accuracy of the solution are desirable. 
One obvious way is to increase the number of waveguide and free 
space modes and check the convergence of the solution as a function 
of the number of modes used in truncating (15). However, for the 
type of kernel involved in this problem, monotonicity of the conver- 
gence is not assured. Nevertheless, convergence is an important check 
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since this numerical solution is variational or stationary for the im- 
pedance.’” 74 Iterative methods and error estimates® 7° may also be 
used for checking the convergence of the solution. Special symmetries 
of the reflection coefficients versus the scan angles which are dictated 
by the array geometry and mode of excitation can serve as a semi- 
independent check. An important independent check in which the 
reflection and transmission coefficients can be measured at special 
scan angles with the aid of a waveguide simulator is used as well. 

The numerical results will in general be presented as a function 
of scan angle. For convenience, however, the differential phase shifts 
between elements will be used as the independent variables. These 
quantities are y, in the x-direction and yw, in the y-direction. Further- 
more, since we limit ourselves to radial planes of scan, we introduce 
the quantity y,. These quantities are related to the directional cosines 
as follows: 


27rb 27rd sin r 
eels We FSB; v= + DL. 29) 
The amount of computation can be reduced when one recognizes that 
the following symmetry in the aperture field as a function of scan 
exists: 


E, (yz, WY) = E, (—¥-, —y,). (30) 


Convergence tests as a function of the number of waveguide and 
free space modes indicated that 18 waveguide modes and 388 free 
space modes yield several percent (usually less than 2 percent) 
accuracy in the magnitude of the reflection coefficients, R;, except 
near sharp changes of R; where the position of the sharp changes is 
accurate to several degrees in y,. 

The energy conservation check, equation (28), is a necessary check 
but not a sufficient one in this problem as well as in other interior 
type boundary value problems.*” 2 33 

One of various special symmetry checks is depicted in Fig. 5. A square 
grid array in the (x, y) coordinates is excited by the vertical TE,,(@,) 
mode. In this coordinate system the array parameters are represented 
in the following way 
a = waveguide radius; b=d; az oe (31) 

Diy = ®, ; Po2y = —9, 


The parameters of the same array, when viewed in the (2’, y’) coordi- 
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Fig. 5 — Symmetries in square grid arrays. 


nate system, can be alternatively represented as 


—_— yr a. a ° 
b (2)?d (2)*d; a 45 (32) 


1 1 
Py = Op {—@; + ®2}; Dy = (2 {Di + @3} 


where the reference to the (xz’, y’) system is denoted by primes. The 
same results should be found for this array at any scan direction 
regardless of the representation. This offers, to a degree, a check on 
roundoff error. Numerically, the reflection coefficients differed only 
by a fraction of a percent. 

An additional symmetry check is given in Fig. 6, where the magnitudes 
of the reflection coefficients are plotted versus y, for a 45° plane of scan. 
Ry and Ry correspond to the reflection coefficients of ®:y and @®zy as 
defined by (81). At ¥, = 240° (shown by a vertical arrow) the main beam 
is grazing, while for Y, > 240° no beam exists in real space and the total 
incident power is reflected and divided between the two propagating 
modes, ®,y and @.,. Of special interest is the point y, = 180° 
X (2)! ew 255°. At this point ¥, = ¥, = v1, = ¥:, = 180° and the array 
excitation is as indicated in the inset of Fig. 6. If the array is to be 
simulated at this scan angle, the appropriate waveguide simulator would 
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Fig. 6— Square grid array: |Riy| and Ai ov| vs wr in the 45° plane of scan. 
a = 048,b =1,d=1,\=1.4 and a = 90° 


consist of the square waveguide (solid lines) shown in the inset. It is 
clear that the horizontal waveguide mode (®,y) cannot be excited. The 
numerical results indeed show that | R.y | = 0 at this scan angle. 
Figure 7 shows a close agreement between experimental and numeri- 
cal results for a rectangular grid array with vertical polarization ex- 
citation scanned in the H-plane. The scan angle which corresponds to 
steering phases y, = y,, = 180° and y, = ¥,, = 0, can be simulated 
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Fig. 7— Rectangular grid array: computed and experimental results vs fre- 
quency at H-plane symmetry point. a = 3.57 cm., b = 16.51 cm., d = 8.261 cm., 
and a = 90°, 


1916 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1968 








Fig. 8— Rectangular grid array: simulator for H-plane symmetry point. 


by the rectangular waveguide (solid line) in the inset. The experi- 
mental results were obtained from measurements of an abrupt junc- 
tion between a circular waveguide and an L-band rectangular wave- 
guide as shown in Fig. 8. 


IV. EQUILATERAL TRIANGLE GRID ARRAYS 


Let us consider the reflection and radiation characteristics of cir- 
cular waveguide arrays arranged in an equilateral triangle grid, and 
the strong dependence of the array properties upon the mode of 
excitation. Grating lobe incipience or a beam at grazing is designated 
by a vertical arrow in the illustrations, 
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Figure 9 shows the reflection coefficient of an array in the #-plane 
scan, with vertical polarization excitation. In this plane the hori- 
zontal mode is not excited because of symmetry so that Rey = 0. As 
can be seen, the slope of both the magnitude and phase of Ryy is dis- 
continuous and singular at a grating lobe incipience, which parallels 
previous observations in rectangular waveguide arrays.2® 2? This is 
related to the asymptotic decay of the coupling coefficients. A forced 
surface wave resonance can be seen around grating lobe incipience 
where | Riy| = 1.0. Notice that this forced surface wave resonance 
is extremely sharp and consequently may not be observed in small 
finite arrays. 

The corresponding transmission coefficient 7’) is shown in Fig. 10. 
The plot of the transmission coefficient is actually the radiation pat- 
tern of a single element in the array environment and it exhibits the 
null which corresponds to a total reflection. Notice that the phase 
of the transmission coefficient will exhibit a 180° discontinuity when 
the magnitude has a zero. The magnitude of the reflection coefficients 
of the same array, in a 60° plane of scan, are shown in Fig. 11. Again, 
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Fig. 9 — E-plane scan of Riv vs y+ (a = 0.48, 
b=1,d=1,\=—14anda= 60°). 
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Fig. 10 — H-plane scan of To vs vr (a = 048,b = 1,d=1,\ = 14, and a = 60°). 
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Fig. 11— [Rav] 3 and Aad vs wr in the 60° plane of scan (a = 048,b = 1,d=1, 
\ = 14, and a = 60° 
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the singular slope at grating lobe incipience can be observed. As can 
be seen, the distribution of the reflected power between the two modes 
is a function of the scan angle. 

An interesting phenomenon can be observed in the corresponding 
transmission characteristics which are shown in Fig. 12. The ¢ trans- 
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Fig. 12 — [Tol a and. ta vs Wr in the 60° plane of scan (2 = 048,b = 1,d=1, 
A == 14, and a = 60° 


mission coefficient, 7’, vanishes prior to grating lobe incipience (the 
positions of the vertical arrows). The vanishing of 7’, at this scan 
angle can be directly related, when coupled with the vector symmetries 
in the array excitation and geometry, to a forced surface wave reson- 
ance. If the array excitation consists of the sum of the two modes 
(equal in phase and magnitude) indicated by the solid arrows in 
Fig. 18, then in the 60° scan plane Ty = 0 by symmetry considerations 
and Ty vanishes as shown in Fig. 12. Since the vectorial sum of the 
two solid arrows in Fig. 18 is the dashed arrow, zero transmission or 
a forced surface wave resonance will occur in the H-plane of this 
polarization. Figs. 14 and 15 indeed show this effect in the H-plane 
scan where | Riy | and | T4| attain unity and zero respectively prior 
to grating lobe incipience.* 

Since the forced surface waves are related to the vector symmetries 
just mentioned, one may anticipate that the scan points at which they 
occur are isolated. Figure 16 indeed demonstrates that the forced reso- 
nances in the / and H planes occur at isolated points. The scan around 


* The difference in the values of ¥, at which these phenomena occur is inherent 
in the definition of y,, equation (20). 
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Fig. 13 — Vector symmetry relationships between transmission zeros and forced 
surface waves, 








216 











144 





cx 
fu 


Zin IN DEGREES 











(eo) 





t 
~N 
fh 

















-144 
0 36 72 {08 {44 180 216 252 288 324 360 


wy, IN DEGREES 


Fig. 14 — H-plane scan of Ray vs yr (a = 048,b =1,d—=1,4 = 14, anda = 60°). 
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Fig. 15 — H-plane scan of Ty vs Yr (a = 048,b = 1,d= 1, = 14, and a = 60°). 


the grating lobe circles in Fig. 17 shows that the peak of the total reflec- 
tion | Rr | = (| Riv |? + | Rey |”)? varies from unity in the H plane 
(6 = 0), gradually decreases, and then increases again and reaches unity 
in the E-plane (6 = 90°). The anomalous behavior near grating lobe 
incipience is eliminated when the polarization of the excitation is changed 
to horizontal, as shown in Fig. 17, indicating thus the strong dependence 
of the forced surface wave resonances upon the mode of excitation. Over 
the frequency band given by 1.8 S$ A S 1.5, qualitatively similar 
behavior of the radiation and reflection characteristics of the array was 
observed. 

Figure 18 shows the reflection characteristics of the array under 
circular polarization excitation. In this case 


1 


~ (2/8 {@, — 7@,}. (83) 


®.y = ©, + 1,}; ®,y = 


uals 

(2) 
The incident mode is ®,y. 

Again the singular slope of these curves at grating lobe incipience 
can be seen. The division of the reflected power between the two 
modes as a function of scan may be observed as well. Fig. 19 shows 
the polarization characteristics of the radiation pattern of an array 
element. The axial ratio, denoted as A.R., is the ratio of the minor 
to major axis of the polarization ellipse while the tilt angle of the 
major axis, 7, is taken with respect to the ¢ axis. As indicated by 
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Fig. 16 — |R2| vs y+ in the vicinity of grating lobe incipience. 
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Fig. 17— #- and H-plane scans of |R.y] vs ¥,- for horizontal polarization (a = 
048,b=1,d=1,A\—=14, and a= 60°). 
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Fig. 18 —|R.v| and |Rey| vs yr in the 0° plane of scan. Circularly polarized ex- 
citation (a = 048,b = 1,d=1,\ = 14, and a = 60°). 


the plot, the element (or the array) far field pattern is circularly 
polarized around broadside, A.R. = 1.0. It deteriorates to linear 
polarization, A.R. = 0, at two points prior to grating lobe incipience. 
The linear polarization at y, = 203.5° with + = 90° results from the 
H-plane forced surface wave resonance of Figs. 14 and 15 where 
Ty vanishes. The null of the axial ratio at ¥, = 207° (with r = 105°) 
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Fig. 19 — Radiated axial ratio (A.R.) and tilt angle (7) vs ¥, in the 0° plane 
of scan. Circularly polarized excitation (a = 048, b — 1, d = 1, X = 14, and 


a = 60° 
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is caused by the difference between the phases of Ty and TJ’, causing 
the ¢ and @ components of the far field to be in phase. 

The reflection and polarization characteristics in the 30° plane of 
scan are shown in Figs. 20 and 21, respectively. In this case the 
single null of the axial ratio at grating lobe incipience results from 
the H-plane forced surface wave resonance of Figs. 9 and 10. For 
planes of scan between 0° and 380° the results corresponding to one 
plane change gradually to those of the other plane. Around grating 
lobe incipience the axial ratio drops appreciably (to around 0.1) but 
does not reach zero. From the circular symmetry of excitation and 
six-fold symmetry of the array geometry, a 30° sector of scan com- 
pletely specifies the array reflection and radiation characteristics. 


Vv. CONCLUSIONS 


A general formulation of the planar phased array boundary value 
problem has been given in terms of a vector two dimensional integral 
equation. The solution of this equation by the Ritz-Galerkin method 
closely agreed with experimental results. 

Equilateral triangle phased arrays of circular waveguides were 
numerically analyzed. It was found that forced aperture resonances 
or forced surface waves, manifested by total reflection and no radia- 
tion, do exist for these arrays even in the absence of dielectric ma- 
terials. These effects were observed over a 15 per cent frequency 
band. The forced aperture resonances occurred prior or close to grat- 
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Fig. 20 — |Rix|] and |Rey| vs ve in the 30° plane of scan. Circularly polarized 
excitation (a = 048,b = 1,d= 1.4 = 14, and a = 60°). 
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Fig. 21— Radiated axial ratio (A.R.) and tilt angle (r) vs Wr in the 30° plane 
ofa polarized excitation (a = 048,b = 1,d = 1, \ = 14, and 
ing lobe incipience in the # and H plane of scan for vertical polariza- 
tion excitation. These resonances were found to occur at isolated 
points as a function of the scan variables and are strongly influenced 
by the mode of excitation. The resonances vanish when the polariza- 
tion of excitation changes from vertical to horizontal. 

The polarization characteristics of the radiation pattern (or al- 
ternatively the radiation pattern of a single excited element in the 
array environment) is shown at selected planes of scan for circular 
polarization excitation. The degradation of the axial ratio resulting 
from the forced surface waves was shown. Total reflection or no 
transmission owing to forced aperture resonances were not observed 
for circular polarization excitation in the cases presented. 

The analysis of coaxial waveguide arrays, as well as the incorpora- 
tion of thin, circularly symmetric irises in the aperture of the wave- 
guide element, can be carried out along lines similar to those dis- 
cussed here. 

The effects of dielectric loading of the array as well as dielectric 
covers and plugs have also been studied.34 
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APPENDIX A 


Floquet Type Wave Functions in Skewed Coordinates 


Consider the periodic array, Figs. 1 and 2, excited with incremental 
phase shifts (steering phases) between adjacent cells along the s, and 
Sp coordinates. A complete set of solutions to the scalar wave equation, 
each of which varies periodically, according to Floquet’s theorem, 
along the s, and sz coordinates is 





sit eades Tv, _ nm | | Pe = 2 
Sma = @Xp (tBin2) EXP | 7 7 fs exp 2) “a q_ |S (34) 


with the integers m,n = — ©,... —1,0,1,2,3,..., + ©. Equation 
(34) describes a wave traveling (or decaying) in the z direction with 
propagation constant B,, (exp —iwt time convention). The steering 
phases y,, and y,, are directly related to the beam pointing direction, f, 
of a radiated plane wave with a vector propagation constant k, = kof’, 
so that (34) can be rewritten as 


Sinn = @XP (tBinnZ) EXP i 6 — 2a, exp i kas — 2a |, . (85) 


The free space propagation vector ky can be expressed in the cartesian 
coordinate system as 


k, = kl T.4 + 7,9 + TA (36) 


where TJ, , T, and T, are the directional cosines of k) with respect to 
that system and #@, #, and 2 are the unit vectors. The quantities 





k 6 = 2s and kp ote (37) 
b d 

are the projections of ky on the reciprocal grid (lattice) coordinates ¢, 

and t, , respectively.’’’’*-°° The unit vectors in the ¢, and #, directions 

form a biorthogonal set with §, and & (Fig. 22). To express (35) in 

cartesian coordinates it can easily be shown that 


Ss, = x — y cota, S. = y/sin a, (38) 
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Fig. 22 — Grating lobe diagram in reciprocal lattice coordinates. 


so that the substitution of (88) into (35) yields 


Sma = exp (Broz) exp i wer. — 2am |, 


“exp i] ker, — ( eats am Vy (39) 


dsina bdtanae 





The propagation constants of the (m, n)th Floquet mode along the 
x and y directions, k, and k, respectively, are 
k, =k-¢ = oT, — 72 ; 
(40) 
Qnn 2am ). 








ky = k-g = bol’, — (<=. b tana 


Since Sn, is a solution to the scalar wave equation, it can be shown 
that 
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= (ke — I? ~ 12)? 


(41) 
a ee = dem = | ee ( 2mm 2am be 
a (i Ee b Krol, dsna ObDtana 


where the positive imaginary root holds for (k2 + k?) > kj (time 
convention exp —2wt). 

Each mode, Sn, for which Brn is real corresponds to a radiated plane 
wave of the phased array. The plane wave with the indices m = 0 and 
n = 0 is identified with the main beam while m or n # 0 corresponds to 
a radiating grating lobe. As a function of 7, and T, (or %, and y;,), a 
given B,,, may become pure imaginary as it goes through a zero, as in 
equation (41). In such cases, the related Floquet mode, S,,,, becomes 
evanescent or nonradiating. By plotting the curves obtained by setting 
the B,,, = 0 as a function of 7, and T, , one obtains a convenient dia- 
gram which illustrates these effects. Setting B,,, = 0 yields 


Mo > Xo MXo ; 
7. —% | a [7. Cre me) | i 
where Ayo = 2xr/ko. As a function of 7, and T,, (42) represents a 
family of circles with unit radius displaced from the origin. This 
diagram of displaced circles constitutes the well-known grating lobe 
diagram, Fig. 22,31 8? 
Notice that the steering phases, y; and yi2, are related to 7, and 
T, through equations (86) and (37). The parallelogram C’D’H’F’ of 
Fig. 22 corresponds to the range of steering phases 


oar Ss Pp SS or; Sa SS YS ar (43) 


and is a periodic cell along the ¢, and ¢, coordinates. 

As mentioned in Section IJ, it is possible to define a complete ortho- 
normal set’ of vector modes {W?,,} over the parallelogram CDEF, 
Tig. 2. The tangential electromagnetic field at the plane z = 0° can be 
expressed by a Fourier series of this set of modes which consists of both 
TE and TM modes (transverse to z). These modes, {72} and {w2™} | 
are given by 











rE _ exp u(rk, + yk,) {hee inate i} i 
‘Ene Odsina We Be led (44) 


and 





wt Sxp Hck F yhy) {hs ¢ + : | (45) 


(bd sin a)? 





m,n 


CIRCULAR WAVEGUIDE ARRAYS 1929 


with k, = (k2 + k?)*. The quantities k, and k, are functions of (m, ) as 
given by (40). The orthonormality of this set of vector modes is defined 
by the following scalar products: 


(Wns Woe) = i} Win CBee )* da dy = Snnva» (46) 
ee 
where 
Be on : for m=p and n= aq, (47) 
0 otherwise 
4! Se ; wr Y= 0 (48) 
and 
(Wan i oe) = Omnis (49) 


APPENDIX B 


On the Invariance of the Scalar Product with 
the Shape of the Periodic Cell 


The orthonormality and completeness of the set {W%,,} of Floquet 
modes, equations (44) and (45), need not be defined over a specific 
periodic cell such as the parallelogram CDEF of Fig. 2. This fact is 
especially significant when a periodic cell intercepts parts of more than 
one circular (or other type of element) aperture. It will be shown that 
the orthonormality of {4?,,} can be preserved over a properly deformed 
periodic cell which contains only a single waveguide aperture. 

Using (37), equation (44) can be rewritten as 


Win = F(m,n) exp i] Yo = 2am 8. + tu 2m, | ; (50) 
-The scalar product betewen two TE modes is 
(Pres Woe) = dnn.ve 
[fet de ay 
(CDEF) (51) 
I We COB)* sin a ds; dse 


(CDEF) 


I 


l 


F(m, n)-F(pq)* sin a 


I 
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ff exp — {| xin — 2) 5 + ORD, | as, ase. 


(CDEF) 
The integral over the parallelogram CDEF in (51) can be divided 
into three (or more) integrals over the triangles CGK and LEI and 
the polygon GDLIFK (Fig. 2): 


(-+ i+ If. on 


CDEF CGK LEI GDLIFK 


Because of the array periodicity, the triangles DHL and LET are 
displaced by b with respect to the triangles CGK and KF J, respectively, 
along the s, direction. Thus, for example, if the s,; coordinate of the points 
within the triangle DHT is s{ given by 


s=s, +0 with dsi = ds, (53) 
then 


I exp — |? oe Pp) si + 2efn = 9) s| ds}, ds, 


DHL 


= exp —i27(m — p) / if exp — i an P) s, + ns ”) s | as, ds. 


CGK 


-| i exp — {| tm =) = PD , it Pen ot s| ds, dso . (54) 


CGK 


Similarly 
i | exp — | im — me P) s, + Zu) s| ds, ds» 


= |/ re i atm =v — p) es ae | us, dS. (55) 


KFJ 
Thus 
i / exp — {| etm = 9) =P) 8s + sent s| ds, ds. 
CDEF 


=| i exp — i]? ee P) 8, + vain = s | ds, ds» (56) 


GHIJ 
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and the orthonormality and completeness of the set { &?,,,} is preserved 
over the new periodic cell GHIJ. This is the two dimensional analog of 
the single dimensional Fourier series whereby the functions and coeffi- 
cients are independent of the initial value of the period. In fact, the two 
dimensional periodic cell can be deformed to any shape which contains 
a single waveguide provided that the area of the cell stays the same and 
the parts of the cell which cause the deformation are translated by b or d 
along the s, and s, coordinates, respectively. 
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The Effects of Strain on Electromagnetic 
Modes of Anisotropic Dielectric 
Waveguides at p-n Junctions 


By JAMES McKENNA and J. A. MORRISON 


(Manuscript received May 22, 1968) 


A first order perturbation expansion is carried out in order to analyze the 
effect of small spatially uniform strains on the lowest order (even) TE and 
TM modes in an anisotropic dielectric waveguide. This generalizes the 
results of an earlier paper in which the effects of certain special cases of 
uniform strain were calculated. Unlike in these special cases, the perturbed 
modes are, in general, neither purely TE or T'M, and one effect of two of the 
offdiagonal components of the strain is to tilt the plane of polarization and 
change the relative phase of the two polarizations. To first order, the modes 
are not exponentially attenuated. Some numerical examples are considered 
in order to illustrate the results. It 1s found that, under appropriate condi- 
tions, the effect of the small strain may be quite large in relation to tts 
magnitude. 


I. INTRODUCTION 


The concept of a multilayered dielectric waveguide is central to 
the theory of the GaP electro-optic diode modulator. As part of a 
detailed study of the properties of electro-optic diode modulators, 
Nelson and McKenna‘ have investigated the possible discrete modes 
which can propagate in a number of such waveguides and have cal- 
culated the detailed properties of the lowest order mode of each 
polarization. 

In the fabrication of a p-n junction a certain amount of strain is 
always introduced. Because of the photoelastic effect?® this strain will 
induce a change in the dielectric matrix describing the unstrained p-n 
junction. In general the strain will be spatially nonuniform, making 
it extremely difficult to calculate modes in such a structure. However, 
a knowledge of the effect of a spatially uniform strain on the mode 
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structure would provide insight into the effects of nonuniform strain. 
The effects of certain special cases of uniform strain on the modes of 
a simple model of a dielectric waveguide were calculated in Ref. 4. 
In the present paper we complete this investigation and calculate the 
modes in the same model dielectric waveguide when subjected to an 
arbitrary uniform strain. We use first order perturbation theory in a 
small parameter describing the magnitude of the strain. 

Although the work presented in this paper was motivated by re- 
search on the theory of the electro-optic diode modulator, the results 
have considerable relevance to the theory of the GaAs injection laser. 
Here too, various dielectric waveguide models have been used to 
explain the light containment.**-*4 The same problems of strain exist, 
and the results of this paper give a qualitative picture of the effects 
of strain on modal structure. The effect of strain on completely 
different types of electro-optic light modulators have been studied by 
Kaminow? and by DiDomenico and Anderson.** 


II. FORMULATION OF THE PROBLEM, AND RESULTS 


In Ref. 4 the symmetric step model was used to study the effects 
of strain. This very simple model exhibits many of the main features 
of interest in dielectric waveguide models. 


The model consists of an anisotropic crystalline slab bounded by the 
planes x = -& w, whose refractive index is raised uniformly by some 
constant amount, the physical origin of which is still obscure, and which 
is embedded in an isotropic medium of relatively lower index of refrac- 
tion. The central slab represents the junction region whose anisotropy is 
caused by the junction field E, acting through the electro-optic effect.” 
The direction of the x-axis is always taken parallel to E;. The isotropic 
medium represents the normal GaP. 

The model is determined by its dielectric matrix, which in the absence 
of strain and for certain orientations of E, with respect to the crystal 
axes can be diagonal in a coordinate system having its x-axis parallel to 
E,. For such orientations of E,, the diagonal matrix elements of the 
dielectric matrix K(x), a = 1, 2, 3, depend only on x. (We use 2, y, 2 
for the coordinates rather than 2;, 2X2, ¢3-) The matrix elements in the 
absence of strain are then defined by the equations 


KO (2) = K. , lz|<w (1) 
. K?@=K, l2|>w (2) 
where a = 1, 2, 3 (see Tig. 1). 
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¢ 
Ke (x) 





Fig. 1— (a) The coordinate system used in the symmetric step model. (b) A 
graph of Ka(z). 


There are two orientations of E, of particular interest which allow us 
to diagonalize the dielectric matrix in the desired coordinate system. 
If E, is in the [111] direction, then the x, y, and z axes can be taken in 
the [111], [110], and [112] directions, while if E, is in the [100] direction, 
the x, y, and z axes can be taken in the [100], [011], and [011] directions. — 
The set of axes determined by the unstrained model will be used in all 
the strain calculations and the dielectric matrix will always be referred 
to these axes. See Ref. 4 for further details of the model. 

In the presence of a uniform strain, the dielectric matrix is in 
general no longer diagonal, and we can write for the dielectric matrix 
elements Kug(x), 


Kid) = KO (at) + Baa, @ = 1,2,3 (3) 
K a(x) = Sap ’ a B. (4) 


The symmetric matrix (7Sag) is the contribution of the photoelastic 

effect?® which we have written in this form for convenience in the 

perturbation analysis. The matrix elements Sag are spatially constant. 

We assume that n°Sag is of order unity, where n is the index of refrac- 

tion of GaP and 7 is a small parameter. In Section II we express 7Sag 

in terms of the strain matrix and give estimates for the size of 7. 
We now seek solutions of the Maxwell curl equations 
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V X 4H = ¢K(2)-E, (6) 

of the form 
E = e(z) exp 7@t — kz), (7) 
H = h(z) exp i@t — kz). (8) 


These solutions correspond to waves travelling in the positive z direc- 
tion, where k = w(uoéo)? = 2m/d is the free space wave number and ) the 
free space wavelength of the light. 

In the strain free cases (7 = 0), there are both TE and TM modes 
and these modes can be either even or odd functions of x. At most 
only a finite number of modes can exist, and Ref. 4 shows that for the 
typical parameter values encountered in GaP diode modulators only 
the lowest order even TI; and TM modes can exist. For that reason 
we confine ourselves here to solutions which in the limit of zero 
strain (y = 0) reduce to even modes. However, the perturbation 
technique used here applies equally well to solutions which in the 
limit » = 0 reduce to odd modes, 


When 7 ~ 0, we seek solutions of Maxwell’s equations of the form 


€.(t) = A, exp —kp(x — w) 


IIV 


+ B, exp —kq(x — w), x=w (9) 
= C, expkr(xa + w) + D, expks(x +w), «cS —w (10) 
= I, exp tkfx + G. exp —thgx 
+ LZ, exp tklz + M, exp —ikmz, la|sw (11) 
for a = 1, 2, 3. The general solution in each region is a sum of four 
linearly independent solutions, but in the regions | z| > w, the bound- 


ary conditions at infinity eliminate two of these solutions. The expres- 
sions for ha(x) can be obtained from equation (5). The various coef- 


IIA 


ficients and parameters Ag... ,p,... can be expanded in powers of 7 
Ay = Ay Aa’ boos, (12) 
D=Po + npit+-:::, and so on. (13) 


In Section III we list the terms in these expansions of order zero and 
one in 7, and in Section IV we outline their derivation. In this section 
we merely discuss some of the features of the solutions. 

We refer to solutions which in the limit as 7 — 0 reduce to even 
TE modes as “perturbed TE modes”; similarly, we refer to “perturbed 
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TM modes.” Expressions for the unperturbed TE and TM modes 
(7 = 0) are given in equations (25) through (32). 

Although the expressions we have obtained for the coefficients and 
parameters are quite complicated, several features of the perturbed 
modes stand out. If S,. ~ 0 or S23 ¥ 0, then the modes cannot be purely 
TE or TM. In general an important effect of the strain is to tilt the plane 
of polarization. This tilt is in general a function of x but not of z. Since 
the coefficients A,, ... are complex in general, the relative phase 
between H, in the perturbed TE mode and £, in the perturbed TM mode 
is a function of x. This relative phase at « = 0 cannot be determined 
unless the method of excitation is known, since all the A, cannot be 
determined, as is shown in Section IV. Ref. 4 considers the special case 
where Si. = S23 = 0, S,;3 + 0 and shows that the modes are rigorously 
TE or TM. That paper calculates only such parameters as 8 and p, not 
such coefficients as A, and B,. The parameters are expanded in two 
small quantities 6 and A describing the unstrained dielectric matrix. If 
we expand the expressions for the parameters in this paper to first order 
in the same small quantities 6 and A (to second order for 8) complete 
agreement is obtained with the Ref. 4 results. 

In the absence of strain, the surfaces of constant phase for both 
TE and TM modes are the planes 2 = constant. However, in the 
presence of strain, the surfaces of constant phase are no longer planes, 
and are different for the perturbed TE and TM modes.* 

Finally, Bo + 78:1 is real in all cases. Thus at least to first order in 
» the modes experience no exponential attenuation as they propagate. 

In order to get some feel for the magnitude of the effects involved, 
we consider several numerical examples. We first estimate the order 
of magnitude of » by relating it to observable phase differences. 
Consider a plane wave whose free space wavelength is A propagating 
over a distance J in a medium of index of refraction n + An. The phase 
difference Ay which this wave would experience over the same wave if 
the index of refraction were n is 


Ay = = (An). (14) 
If yn? is the photoelastic contribution to the dielectric constant, then 


An = (n? + yn’)? — n & Inn. (15) 
Therefore, we have 


n & \Ag/nlx. (16) 
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Now the upper limit of phase shifts observed’’ in GaP at \ = 6328 A 
over a length 1 = 0.6 mm is about 7/4. Taking” n = 3.31 this yields 
n & 0.8 X 10°*. This probably represents an extreme upper limit, and 
so we will assume 7 = 10°° in our examples. Recent X-ray measure- 
ments” of strain in P doped Si yield a value of 7 of about 107° when the 
concentration of the dope is Np ~ 10** atoms per cubic centimeter.” 

It can be shown that the matrix (7S ,g) is approximately related to the 
strain matrix (€4s) by the equations” 


3 
78 ap = —n' >» P epur€ur (17) 

By ve=l 
where 7 is the index of refraction and P q¢,, are the elasto-optical coeffi- 
cients. Crystals of class 43m have only three different elasto-optic 
coefficients when referred to the crystal axes. (See p. 251 of Ref. 10.) 
For GaP these are” 


Pi — —0Q.151, Pig = —0.074, Pig = —(0.082. (18) 


Since® n = 3.31 for GaP, and n~Sgg is at most of order one, it 
follows that the magnitude of the strain is roughly proportional to 7. 
In order to obtain the values of the elasto-optic coefficients in the 
coordinate system used in this paper, it is necessary to make a trans- 
formation of the elasto-optic tensor from its representation in the 
crystal axes. We will not do that here; rather we take (7Sag) as given. 
In Table I we define three possible strain contributions to the dielectric 
matrix, labelled a, b and c. Matrices a and b were chosen to demon- 
strate the effect of the off-diagonal elements S,. and Sos, respectively 
(Ref. 4 considered the effect of S,3; alone), while c was chosen to dem- 
onstrate a possible effect when all the off-diagonal elements are nonzero. 

For a GaP diode modulator we can write” 4 


K,=n (1 — 6,), a = 0, 1, 2, 3, (19) 
where 2 = 3.31 is the index of refraction of GaP”’ and the quantities 
5,.,a@ = 1, 2,3 are functions of the applied bias voltage V. In the original 


TABLE I—STRAIN ConTRIBUTION TO THE DIELECTRIC MAtTRIx* 


Type n Su S22 S33 Si S23 S13 
a 10-6 0 0 0 10 0 0 
b 10-6 0 0 0 0 10 0 
c 10-8 7.07 7.07 7.07 7.07 7.07 7.07 


* Components of the strain contribution, S;; , and the magnitude parameter 7. 
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symmetric step model 6) is independent of V. For E, || [111] Ref. 4 
showed that 


6; — — 26, do = 03 = 6, (20) 
where 
6=-— Eyran?/(3)}, (21) 


and where EB; is the (spatial) average junction field, 74; is the electro-optic 
coefficient, and 7 is the index of refraction. For a typical diode (diode 
KC46CA of Ref. 9), E,(measured in V/m) is related to the diode half 
width w(measured in m) and bias voltage V (measured in V) by° 


Ey = (2 — V)/(2w). (22) 
The half width can be determined by capacitance measurements® ° 
and related to the bias voltage by 


w(V) = 0.189 X 107°(1 — V/1.8)°2®. (23) 


Using the value rz, = —0.86 x 10°? m/V,* we can now calculate 
51, 62, and 83 as functions of V. 

For this diode, 8) = 1.612 x 10-*. However, it has been shown 
that the voltage dependence of the parameters of the symmetric step 
model is not correct, and the double walled waveguide much more 
closely describes the true voltage dependence.* ° We have not used 
the double walled model because it is analytically complex. Instead, 
since the modes in the single and double walled guides are very similar 
in form because they both decay exponentially as functions of x out- 
side the guide, we have used the single walled model but simulated 
the voltage dependence of the double walled model. This has been 
achieved by letting 8) vary with voltage. The voltage variation of So 
has been obtained by requiring the equality of expressions (2.33) and 
(3.18) in Ref. 4 for the decay constants p, and letting w, = w(0) and 
We = w(V). This yields the relation 


Sy) = (2.24 X 107”)/w. (24) 


In Table II we list these basic constants describing the unstrained 
diode as functions of V. Using these values, we can calculate from 
equations (33) through (37) the parameters of the unstrained TE 


* This is the unclamped value of ra given in Ref. 18. After these calculations 
were made it was determined that the clamped value ra = —0.97 * 10712 m/V 
should be used. However, since our results supply only qualitative information 
about actual diodes, we have not redone the numerical example. 
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TABLE [I-—CHARACTERISTICS FoR A TypicAL GaP D1opE* 


Bias 


voltage 
(Vv (10-§ cm) 10! 6: 104 62 104 63 104 do 
—2 1.87 —1.17 0.58 0.58 12.00 
—12 3.04 —2.51 1.25 1.25 7.38 
—24 3.85 —3.67 1.84 1.84 5.82 


* Given as functions of the applied reverse bias voltage V. The half width of the 
junction is w, and the components of the unstrained dielectric matrix are K; = 
n1 — 6;),7 = 0,1, 2, 38, where n = 3.31 is the index of refraction of GaP. 


and TM modes for ’ = 6328 A. These values are listed in Table IIT. 
Finally, in Tables IV and V we list the parameters of the correspond- 
ing perturbed TE and TM modes respectively. The accuracy of those 
terms less than 10 is uncertain in case c of Tables IV and V. In 
Figs. 2 through 7 we plot some of the components of the perturbed 
TE modes correct to first order in 7. In Figs. 2, 3, 5, 6, and 7 the 
imaginary part of the component is negligible and is neglected, while 
in Fig. 4 the real part is negligible with respect to the imaginary part 
and is neglected. In all cases the es component is negligible compared 
to the e; component. We have chosen the undetermined coefficients 
so that at x = 0, 2 = 0, ee in the perturbed TE mode and e, in the 
perturbed TM mode have zero phase to first order in 7. 

This example illustrates how much tilting of the plane of polariza- 
tion, or coupling of the TE and TM modes, is to be expected. The 
Si2 component produces the main effect, which from Figs. 2 and 3, 
is a maximum tilt of the plane of polarization of 3.5°. This effect 


TasLE II] — UnrErtrurRBED Mopr PARAMETERS* 





Type Bias 

of voltage 
mode (V) Bo Po to lo 
TE —2 3.308 0.0226 0.1096 0.1180T 
TE —12 3.309 0.0195 0.0796 0.1022+ 
TE —24 3.309 0.0160 0.0641 0.10077 
T™ —2 3.308 0.0259 0.1089F 0.1174 
TM —12 3.309 0.0307 0.0759F 0.0994 
TM. —24 3.309 0.0363 0.0552, 0.0953 


* Describing the unstrained TE modes, and the parameters Bo, po and ly de- 
scribing the unstrained TM modes as functions of the applied reverse bias voltage V. 
The wavelength of the light is 6328 A. 

} Derived parameters [y for the TE modes and fy for the TM modes. These derived 
parameters appear only in first and higher order corrections to the field. 
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Taste ITV—PARAMETERS FOR PERTURBED TE Mopers* 











Type Bias Pi n 
of voltage a ee es ee eens 
strain (VY) Ai Re(p.) Im(p1) Re(q) Im(q) h mt 
—2 10 221 —221 0 0 
a —12 |0 257 —257 0 0 
—24 |0 312 —312 0 0 
—2 10 1.51 —1.51)0 0 
b —12 #|0 1.51 —1.51/0 0 
—24 1/0 1.51 —1.51)0 0 














—2 |1.07; 156 | —0.3X1077| —156 | —2.14/2.13} —0.4X107° 
c —12 {1.07} 181 | +0.2x1077| —181 | —2.14/2.138; —0.4x1078 
—24 {1.07} 221 —0.2X1077 | —221 | —2.14|)2.13| —0.4X1078 














* For all perturbed TE modes fi = gq: = 0. 


decreases with increasing reverse bias voltage. However, it should be 
noticed from Figs. 4 and 5 that the coupling effect resulting from 
Ses increases with reverse bias voltage. The e,; component is roughly 
proportional to 7, so a doubling of the strain would double the mode 
coupling. Mathematically, the existence of this relatively large effect 
results from the largeness of the factor c given in equation (64) for 
perturbed TE modes and in equation (84) for perturbed TM modes. 
The TM modes exhibit a similar behavior. 

From Tables IV and V we see that the changes in the parameters, 
7P1, 791, 781, and so on, are indeed small, which gives us confidence 
that the perturbation treatment is reasonable. 


III. FORMULAS FOR THE SOLUTIONS 


To list the formulas for the coefficients and parameters, Aqg,... , 
p, ... (which appear in the expressions (7) through (11) for the 
solutions in terms of the various parameters describing the symmetric 
step model and the strain matrix), we begin by writing down the solu- 
tion for the strain free (7 = 0) case for both the even TE and TM 
modes. When » = 0, we have for the even TE modes 


éi(z) = e3(x) = 0, all x (25) 
€o(a) = cos (kfoz), |e|sw (26) 
= cos (kfypw) exp kp.(w — | x }), era ea) (27) 


while for the even TM modes 


TABLE V— PARAMETERS FOR THE PERTURBED TM Mopss 




















Type Bias pi qi 
of voltage 
strain (V) Bi Re(p1) Im(n1) Re(q) Im(q) nf 91 ly m1 
—2 0 193 —193 0 0 0 0 
a —j2 0 163 —163 0 0 0 0 
—24 0 138 —138 0 0 0 0 
—2 0 1.51 —1.51 0 0 0 0 
b —12 0 1.51 —1.51 0 0 0 0 
—24 0 1.51 —1.51 0 0 0 0 
—2 1.07 136 —0.2X107 —136 —2.13 —1.2X107 —1.2x*10- 2.13 —2.13 
Cc —12 1.07 115 —0.2xX1077 —115 —2.14 1.9X10- 1.9X107 2.13 —2.13 
—24 1.07 97.3 —0.3x1077 —97.3 —2.14 0.721074 0.721074 | 2.13 —2.13 
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Fig. 2— The relative amplitude of the real part of e: for the perturbed TE mode. 








e(x) = cos (kl,2), |2|sw (28) 

= = cos (klow) exp kpo(w — | x |), |x| >w (29) 

e.(xz) = 0, all x (80) 

e;(a) = iste sin (kl,2), [2 | sw (31) 

= z Pols cos (klpw) sgn (x) exp kp(w — |x|), |x] 2 w. (82) 
BoKo 


The parameters in these equations are given for the TE modes by 
the positive roots of the system of equations for po, Bo, and fo 


> = Bi ~ Ky, (33) 
fi = K.-— 6, (34) 
fo tan(kwfo) = Do (35) 


while for the TM modes the parameters are the positive roots of the 
system of equations for po, Bo, and lp, consisting of equation (83) and 


I, = Ks — (Ks/Ky) Bo, : (36) 
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Fig. 4— The relative amplitude of the imaginary part of e: for the perturbed 
TE mode. 
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Fig. 5 — The relative amplitude of the real part of e. for the perturbed TE mode. 


Kolo tan(kwl) = Kspo. (37) 


The expressions for h(x) can be obtained from equation (5). 

We now turn to listing the formulas for the coefficients and param- 
eters of the solutions for the perturbed TE and TM modes. For both 
the perturbed TE and TM modes we have the relations 


m= (a (Ge)me + 89-+ 5. 8a 
qi Do By 4K , [Ko(So2 + Sys) + Bo(Si S33) + 27 08oSi3] 


1 2 ; 
=e (Fe) {io S — 833) — Bo(Si1 — S33) — 22o8oS:s] 


+ 4K o[BoSi2 + époSes]’}*, (38) 
where p; corresponds to the “+” sign and q; to the “—” sign. It is 
also true, at least to first order in y, that 

Ca(—2x) = €_(x)*, a = I, 2, 3, (39) 


hence we only list those parameters determining the solution inz = —w. 

For the perturbed TE modes o, fo, and Bo are the positive solutions 
of the system of equations (83) through (85), and Jy is then given 
in terms of Bo as the positive root in equation (36). The remaining 
parameters are 


Qo = Po ’ | (40) 
Jo = fo, (41) 
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Fig. 6 — The relative amplitude of the real part of e: for the perturbed TE mode. 


tg = lo:4 (42) 
Bi = S22/(2Bo). (43) 


The quantities p,; and gq, are now determined by equations (38) and 
(43). Next we have 


I 


A=n= (S22 — 28081) /(2fo), (44) 
1.0 

STRAIN 

TYPE C 
0.8 4 Das 
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Fig. 7 — The relative amplitude of the real part of e. for the perturbed TE mode. 
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L 
mM, 


= 57 (Si(Ks — B) — 2K BoB, + Soa(Ks — 68) te 2Srala60l, 
140 
(45) 
where 1, corresponds to the +” sign and m, to the “—” sign, and in 
equations (44) and (45) £, is given in equation (48). Notice that 
from (48) and (44), f: = g, = 0 for the TE case. 
The expressions for the coefficients are 


AY” = ~Bo(BoSi2 + UPoSea) cos (kfow)/(2K opo(p, — q1)], (46) 
2 = —q cos (kfow)/(p. — q), (47) 
AS” = 4(Po/Bo) Ai”, (48) 
Bo = me Bae (49) 
2” = p, cos (kfyw)/(p. — u), (50) 
he a 2(Po/Bo)B:” , (51) 
FO =4G2 =0, «a =1,3 (52) 
PS = GO =3 (53) 
TL = Mo = 0, aS 12.3 (54) 
FY” 2 
- = 3c [+ SesBofo — Si(K, — fo] , (55) 
1 
FY = @: (56) 
yg 
5 = 3C [+ Si2Bofo = Sis(K, aa Bo) ] ’ (57) 


(1) 
3 


= 41, cos (kwfo) {aSo3(285(Ki — Kye — 1] 


(1) 
3 
bBo 2 
sk - S,2[2p.(Ko — Ks)e — 1]} , (58) 
A = (BoK3/l,K,) Ls”, (59) 
M;” = —(B.K3/lKi)M3”, (60) 


LL,” = M;? =0, (61) 
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where 
a = [Ky cos (kwh) + poKs sin (kwl)]’, (62) 
b = [1K sin (kwl,) — pol; cos (kwl,)|", (63) 
¢ = (KiK, ~ Kx — Kf)", (64) 


and ~,; and q; are the values appropriate to the TE modes. 
Finally, we write down the three combinations 


Ay” + BY? = —i(8o/po)[As” + B3”] 
— COs (kfow) [(po — Ko)Si2 + tBoPoSes)/(2Kopo), (65) 
As”? + B;” = cos (kfpw)[F3” + G3], (66) 


AS? + BY = [FL + GL] cos (kfow) + aS? — GS] sin (kfow) 
+ [L2 + M2] cos (klow) + i[L8? — MP] sin (klw). (67) 


The coefficients FS = G§, and hence Af? + B{”, are arbitrary and 
correspond to an overall multiplicative constant. They can be set equal 
to zero with no loss in generality. We discuss this point further in 
Section III. Moreover, the individual coefficients A{”, Aj’, Bf”, and 
Bf cannot be determined at this stage. However, the terms we have are 
sufficient to determine each component of the field up through order 
one in 7. 

For the perturbed TM modes po, lo, and Bo are the positive solu- 
tions of the system of equations (33), (36), and (87), and fp is given 
in terms of By as the positive root in equation (34). The parameters 
go and pp are still related by equation (40), m and ly by equation 
(42), and go and fy by equation (41). The remaining parameters are 


B, = [1/(28o) [Ko Kilo + Kp) + ‘ia 
° {(S1185/Ki][KoK spo “1 Kils Ft 
+b [Ki Sssl5/ K3] [K3(K; — Ko) po TG, }} , (68) 
where 


f= (kpow)(Kspo + Kolo). (69) 


With the aid of (68) and (69) for 8:, p; and q; are determined by 
equation (38), f; and g, by equation (44), and 1, and m, by equation 
(45). 


WAVEGUIDE STRAIN 1949 


The expressions for the coefficients are 


Ba = —K, cos (kwly) (2808: — 2p: — Soo)/[2poKo(pr = qi), (70) 

2” = —K, cos (kwh) (BoS12 + tPoS23)/[2po8o.Ko(p: — a), (71) 
BY = K, cos (kwly)(28o8: — 2p091 — So2)/[2p.Ko(p: — q)), (72) 
Be = —A9 3 


where f, is given by equations (68) and (69), p: and q, are the values 
appropriate to the TM modes, and A$” is related to A{” by equation 
(48) and Bi to B® by equation (51). Furthermore, 


BL ag 4, a=1,2,3 (74) 
L? = MP = 4, (75) 
= MP = 0, (76) 
eS LK) Bos), (77) 
Pei a eG a6. (78) 
Pe 


cos (kwlo) {BoaS12[Ki + 2cpo(Ko — K;)] 


~ 
as DobSo3[Ky + QWcBo(K, = Ko))}/ApoboKo), (79) 


i” 
MS”? 
AM 4 BO = [FO + GL?) cos (kwh) + iP? — G2] sin (kwf,) 

+ (LS? + M3] cos (kwl)) + i[Z3” — Mz”) sin (kwl,), (81) 


= —4e[Si. & Ki S03/(8.Ks)] (80) 


where 
a = [po cos (kwfo) — fo sin (kwf.)]’, (82) 
b = [fo cos (kwfo) + po sin (kwf.)}", (83) 
c = (K, — Ij — Bo)’. (84) 


Just as in the perturbed TE case, the coefficients cannot all be deter- 
mined uniquely. We can with no loss of generality set 


Ly SS” 0, (85) 
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Once this choice is made, we have 


® = [Sis — IB: — Bo + (Sea — 2lol:)(loK1/BoKs)1/(2Bolo), (86) 
My” = [—Sis — eB: — mBo + (Sss — Qlom)(loK1/BoKs)]/(2Bolo), 
(87) 
s+ Sy) = {—(Sispokw)/Ko 
+ i[3kw(K2608u/K1 + K,UGS833/K; — 2K 38081) 1/(KsBo)} cos (kwlo) 
(88) 
Ay? -+ BY? = —%i(8o/po)[As” + Bs] + (Ki cos (kwl))/KoBopo) 
*[—KoB:/po — BoSu(po — Ko)/(2poKo) 
+ poBoSsa/(2Ko) — tpo813/Kol. (89) 


A knowledge of these terms is sufficient to determine each component 
of the field up through order one in 7. 


IV. DETAILS OF THE CALCULATIONS 


In standard fashion H can be eliminated from equations (5) and (6) 
by taking the curl of equation (5) and by making use of the assumed form 
of the solutions, equations (7) and (8). There results the system of 
equations 








3 
18 a > Be, + DS Kyla = 0, (90) 
des _ 
dé” Cg = 0, (91) 
fe St pee + > Ka gen = 0, (92) 
dé dé 
where we have introduced the new independent variable 
E = ke. 98) 


The standard boundary conditions” on E and H yield the conditions 
that €2, €s, deo/dé, and de;/dé +- i8e, must be continuous at § = o = kw. 

The general plan of the calculation is first to consider the equations 
obtained by substituting into equations (90) through (92) the expres- 
sions for eg in the various regions given by equations (9) through 
(11). From these equations, one can determine up through first order 
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in y all but one of the parameters and some of the coefficients as 
functions of the parameter 8,. Upon substituting these values into 
the boundary condition equations, a set of equations is obtained from 
which 8; and some of the remaining coefficients can be determined. 

Since the components of the electromagnetic field satisfy a linear, 
homogeneous system of equations, it follows that if eg(v), «@ = 1, 2,3 
is a solution set, then so is (1 + ayy + don? +...) @a(%), e = 1, 2, 3, 
where the constants a;, a2, ... are arbitrary. For example, if the 
coefficients Aq, Ba, ... given by expansions of the form (12) rep- 
resents a solution, then the coefficients given by expansions of the form 


A, = AD + 1aAP + ADI +->:, (94) 
with the same a, used in each expansion, represent another solution. 
Thus unless the corresponding zeroth order coefficient is zero, the first 
order coefficient cannot be uniquely determined. We do, however, 
have the arbitrary constant a; at our disposal. The multiplicative 
constant (1 + din + ...) can only be determined from a knowledge 
of the excitation of the mode. 

If the assumed expressions for e, in § 2 o given by equation (9) are 
substituted into equations (90) through (92) we get the set of homoge- 
neous, linear equations in A,, a = 1, 2, 3, 


(Ko + Su — B’)Ar + Si24e + (nSis — iBp)As = 0, (95) 
nSi2Ay + (Ko + 1Se2 + p = B)A, + 9So3A3 = 0, (96) 
(nSi3 — iBp) A, + S234. + (Ko + 7S33 + p)As; = 0, (97) 


plus a similar set of equations with A, replaced by B, and 7 replaced 
by q. The condition that these equations have a nontrivial solution, 
the vanishing of the determinant of coefficients, yields a relation be- 
tween £ and p of the form 


D(p, 8) = 9, (98) 


where D(p, 8) is a quartic polynomial in p and f. The second set of 
equations involving the B, and q yields the same determinantal equa- 
tion with p replaced by q, 


D(a, B) = 0. (99) 


That is, g is a second root of the quartic. If p, q, and B are expanded 
in powers of 7 as in equation (13), equations (98) and (99) can be 
expanded in powers of 7 and the coefficients of the various powers of 
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n can be equated to zero. The vanishing of the lowest order term 
yields equation (33), which is satisfied by both po and qo, thus also 
yielding equation (40). The vanishing of the first order coefficients 
shows that p, and q; are the two roots of a quadratic which are 
given by equation (38). These results are independent of the TE or 
TM character of the mode. 

Equations (95) through (97) can now be expanded in powers of 7 
by substituting in the expansions of p, 8, and A,. The three similar 
equations involving q, B, and B, can be expanded in powers of y in 
the same way. Because pp and {op satisfy equation (33), equation (96) 
vanishes to zeroth order in y, while equations (95) and (97) yield 


(Ky — 8o)Ai” — iBop.As” = 0, (100) 
—1Bop.Ay” + (Ko + p)AS” = 0. (101) 


The determinant of this pair of homogeneous equations vanishes because 
equation (33) is satisfied, so a nontrivial solution exists. The quantities 
B{® and BS satisfy the same equations. Using equation (33), it follows 
from equation (101) that A{® and A{® are related by equation (48), and 
B{® and B® are related by equation (51). 

To first order in y, equations (95) through (97) are 


(Ko — Bo)Ar”? — *Bop.A5” 
= —[(Si1 — 288:)Ar” + Si242” + (Sis — tpoBi — tp:8o) As], 
(102) 
Si2Ar? + (2popr1 — 268i + S22) AS” + S23A5” = 0, (103) 
—iBypoAy? -+ (Ko + po) As” 
= —[(Sis — tpob: — ip:8o) At? + SosAS° + (Ss + 2pop.) As” ]. 


(104) 


With the replacement of A, by B, and p by q in equations (102) 
through (104) we obtain the first order equations satisfied by the By. 
At this stage we must differentiate between the perturbed TE and 
TM modes. For the perturbed TE modes we must have 


1” + BI? = —i(Bo/po){As” + Bs] = 0, (105) 
A” + B” = cos (kfw), (106) 
while for the perturbed TM modes 


I 


AY? + BO = ~i(Bo/p.)[AP + BY] = (Ki/K,) cos (blow), (107) 
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As” + B” = 0. (108) 


If we now add to equation (103) the equivalent equation in B,, and 
make use of the fact that A“? + B“?, a = 1, 2, 3 are prescribed for the 
perturbed TE modes, in equations (105) and (106), we get a new equation 
involving only A$ and BS”. This equation together with equation (106) 
can be solved for A{” and BS” to yield (47) and (50). Once A$” is known, 
A{® and A{®” can be determined from equations (103) and (48) yielding 
(46). We get B® and BS° from equation (105). In the same fashion we 
determine A®” and B®’, a = 1, 2, 3 for the perturbed TM modes. 

Equations (102) and (104) [and the two equivalent equations in B{” 
and B$”] are two inhomogeneous equations whose determinant vanishes. 
Thus the left side of (102) is a multiple of the left side of (104), and the 
equations are compatible only if the right side of (102) is the same 
multiple of the right side of (104). This can be shown to be the case, and 
so (102) and (104) provide just one relationship between A{” and A;”. 
There is a corresponding relationship between B{” and B$”. 

By replacing A,, B., p, q by C., Dz, —7, —8, respectively, in the 
equations so far obtained, the formulas for the region § S —co are ob- 
tained. Here —r and —s are the remaining two roots of the quartic 
D (p, B) = 0. , 

Next, if the assumed expressions for e, in |é| < o given by equation 
(11) are substituted into equations (90) through (92) we get four sets 
of three homogeneous, linear equations in F, ,G, , L, , and M, , 
respectively, which hold for both the perturbed TE and TM modes. 
These equations are obtained from equations (95) through (97) by 
replacing A, and p by F, and —7/,G, and ig, L, and —zl, and M, and 
um, respectively. 

The determinental equation for each of these four sets of homo- 
geneous equations can again be expanded in powers of 7, and the coef- 
ficient of each power of » separately equated to zero. The vanishing 
of the zeroth order coefficients yields equations (34), (41), (86), and 
(42) relating fo, go, lo, and mp to Bo. The vanishing of the first order 
coefficients yields equations (44) and (45) relating fi, gi, 1,, and m 
to Bi. 

Each of the four sets of homogeneous equations can be expanded in 
powers of 7, just as for the equations describing the region & = o. To 
proceed further, we must again differentiate between the perturbed 
TE and TM modes. For the perturbed TE modes, equations (52) 
through (54) must be satisfied, while for the perturbed TM modes, 
equations (74) through (77) must be satisfied. These values satisfy the 
lowest order equations identically. 
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For both the perturbed TE and TM mbdes: the first order equations 
can be written as 
(K, — Bo)Ft? — BofoFs” = —SrF2”, (109) 
—BofoF!” + (Ks — fos? = —SosF3”, (110) 
(Ky — fo — Bo)F2” 
= —S2F — SF — (See — ofr — 2808) FS. (111) 


For both the perturbed TE and TM modes, equation (111) vanishes and 
yields no information, while equations (109) and (110) have a nonzero 
determinant and so can be solved for F{” and F{” yielding the solutions 
given in (55), (57), and (78). If we replace , and f by G, and —g, we 
get equations which can be solved for G{” and G{” yielding solutions 
given in (55), (57), and (78). The equations obtained when F’, and f are 
replaced by L, and l, and M, and —™m, respectively, have a different 
character. The two equations in L$” and M$” corresponding to (111) do 
not vanish identically and can be solved for L$” and M$. The solutions 
are given in (61) and (80). The equations in L{” and Li”, and M{? and 
M%, have a vanishing determinant. In the perturbed TE case, the 
equations are homogeneous and yield (59) and (60). In the perturbed 
TM case, the equations are nonhomogeneous but compatible, and yield 
the relations 


—BoloLi” + (Ko/K,)L3” 

= —4(S1s — lob: — bBo) — 3(Sss — Qbh)(loKi/BoKs), (112) 
BolbMx” + (Ks80/K1) M3” 

= —2(Sis + lB + miBo) + 3(Ss2 — 2lomms)(oK/BoKs). (118) 


We finally turn to the boundary conditions at £ = -to, of which there 
are eight, four at each boundary. They can be grouped as follows 


A, + B, = Fre’ + Gre" + Lie” + Moe *”’, (114) 

C, + D, = Foe” + Gie” + Lie” + Mie”, (115) 
—pA, — qB, = ifF.e” — igGre'”” + iL," — imM.€'"", (116) 
rCz + sD, = ifF.e* — igG.e’’ + il.’ — imM.e”’, (117) 
A; + B, = Fre’ + Gre *”” + Le’ + Mae *”’, (118) 

Cz + D; = Fie” + Gye” + L,e*” + Mie", (119) 
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—pA; qB; + 1B(A, + B,) = i(fF’ ++ BF )e”” ++ 1(—gG's + BG,)e*”” 
+ i(lL; + BL,e’ + i(—mM, + BMe*”’, (120) 
rC,; + sD; + i@(C, + D,) = fF; + BFDe”’ + i(—gG, + BG,)e’”’ 


+ i(1Ls + BLE" + i(—mM, + BMy)e"". (121) 
These equations split naturally into two groups, one group involving 
only the subscript 2 and the other group involving only the subscripts 
1 and 3. These equations can be expanded in powers of 7. The zeroth 
order equations are satisfied as long as (35) holds in the perturbed 
TE case and (37) holds in the perturbed TM case. 

For the perturbed TE modes, the first order expansion of equations 
(114) through (117) yields four nonhomogeneous equations in A$” + 
BY, CS? + DY, FS and G{?. The inhomogeneous terms on the right 
side of these equations contain the parameter §,. The determinant of the 
equations vanishes, and then the condition that they be compatible 
provides an equation from which §,, given in (43), is determined. Once A, 
is determined, these equations yield (56) and (66). We can now choose 
the arbitrary parameter a,—indicated in (94)—so that F{? = 0. Then 
from (56) and (66) G§? = Af? + B& = 0. In addition, since (; is real, 
it can now be shown that 7; = p*, s, = g%, C® = AM*, DO = BO*, 
and CS)’ + D® = [AM + BY]*, a = 1, 2, 3, which justifies equation 
(89). Finally, the first order expansion of equations (118) through (121) 
can be combined with equations (59) and (60), equation (102), and the 
corresponding three equations in B{” and Bj”, C{? and C$”, and D{” 
and D to form a set of equations from which AY + BY = [Cc 
+ D®)*, and M“, a = 1, 3, can be determined. These are listed in 
(58), (65), and (67). 

For the perturbed TM modes the procedure is virtually the same, 
except that it is now the first order expansion of equations (118) 
through (121) which has a vanishing determinant. The condition 
that these be compatible then yields the expressions (68) and (69) 
for B,. This set of equations also yields the result that 

LY + MY = 0. (122) 
We can now pick the arbitrary parameter a, so that L{” = 0, which 
combined with (122) yields (85). Equations (112) and (113) now yield 
(86) and (87). The first order term of equation (118) then yields (88), and 
this result, combined with the equation obtained by adding equation 
(102) to the corresponding equation in B yields (89). Finally, equations 
(114) through (117) yield expressions (79) through (84) for F§’, G&, 
Ly”, M3”, As” + By”. 
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A State Variable Method of Circuit Analysis 
Based on a Nodal Approach 


By R. E. PARKIN 


(Manuscript received April 12, 1968) 


A method which is well suited for implementation on a digital computer 
is presented for the solutions of active circuits. Unlike many state variable 
approaches the state vector is defined as the set of voltages which exist between 
certain nodes and the reference node. An advantage of this approach is that 
degeneration in the order of complexity of the network caused by capacitance 
loops is handled automatically. Any type of controlled source can be spectfied. 
From the basic algorithm the circuit is specified in matrix form by inspection 
using standard nodal methods, and the solution 1s obtained by a systematic 
reduction of this one matrix equation. An upper bound on the order of 
complexity of the network 2s evident from the network topology or the 
partitioned form of the original matrix. Inductors are included in this 
approach by considering the equivalent gyrator-capacitor combination. 


I. INTRODUCTION 


State variable techniques presently being used to analyze networks 
require a detailed knowledge of graph theory.’ Another method of 
state variable analysis that is based partly on a nodal approach and 
does not require a detailed knowledge of graph theory is very re- 
strictive. The method presented here performs a nodal analysis on a 
transformation of the network in which all magnetic storage elements 
have been replaced by gyrator-capacitor equivalents, and nothing 
more than a basic knowledge of graph theory nomenclature is re- 
quired. The rctmst* network can be transformed to an equivalent 


* Resistor, capacitor, inductor, mutual inductor, source and ideal transformer. 


1957 
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Fig. 1— Inductor and transformer equivalents. 


resistance capacitance source network using the gyrator-capacitor 
equivalents shown in Fig. 1. Each gyrator shown in Fig. 1 has the 
indefinite admittance parameters 


ee 0 2 SUV, 
Is = —I 0 1 Ve ; 
Te 0 -1 OJLV~ 


choosing this type of gyrator enables the capacitor value in farads of 
the equivalent pair to be equal to the inductor value in henries. 

Let the number of nodes of a transformed network be n. Using 
Kirchoff’s current law, it can be shown that for an n-node RCS net- 
work 


CV =I- GV (1) 


where I is an (n — 1)th ordered column vector representing the currents 
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injected into the nodes, V is an (n — 1)th ordered column vector repre- 
senting the voltages between the nodes of the network and the reference 
node. If the transformed network contains | capacitors then the matrix 
C is an (n — 1)th ordered symmetric matrix which contains imbedded 
within it 1 second order indefinite matrices, each having the dimensions 
of farads. Similarly G represents the resistors and has the dimensions of 
mhos, but G may be asymmetric. Node n is the common or ground node 
of the network; for convenience this node is always assumed to have 
capacitors connected to it. 

The objective is to find an upper bound on the rank of the ca- 
pacitance matrix C by partitioning C as described in Section II, and 
reducing the matrix equation (1) containing the partitioned matrix 
C to the rank of C; this reduction is symbolic and does not take into 
account degenerate cases which can occur. It is shown in Appendix B 
that for all conditions, for any type of circuit, an upper bound on the 
order of complexity of the network (rank of C) can be found from 
the network topology. 


II. PARTITIONING OF THE CAPACITANCE MATRIX 


There are basically four types of voltage source (vs), the independ- 
ent vs (1vs), the voltage dependent vs (vpvs), the current dependent 
vs where the current is through a resistor (cpvsr), and the current 
dependent vs where the current is through a capacitor (cpvsc). It will 
be shown that the only current source (cs) which can effect the parti- 
tioning is the current dependent cs where the current is through a 
capacitor (cpcsc). As a result, any type of cs will be termed simply 
a CS, unless it is a CDCSC. 

The method of partitioning makes the reduction of the matrix 
equation (1) to its rank a simple process. Generally only the voltage 
at a node to which a capacitor is connected can be a state variable 
node. However it is possible to choose a node to which a cpcsc or 
cpvsc is connected as a state variable node instead of one of the 
nodes of the capacitor whose current supplies the dependence, but this 
possibility is avoided automatically in the partitioning method pre- 
sented here. 

The presence of inductors and time-invariant, independent cs’s 
forming a cut-set in the original untransformed network causes a 
linear dependence problem in the transformed network. In the trans- 
formed network such a cut-set appears as a capacitor tree with gyra- 
tors only connected to the end nodes of the tree as shown in Fig. 1, 
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and gyrators and perhaps time-invariant independent c.s.’s connected 
to the central node (the Gcnope) ; the nodes of this capacitor tree will 
be called the acser nodes. 

The capacitors in the transformed network can be divided into two 
classes, those connected to the reference node directly or through a 
vs-capacitor chain (the fixed capacitors), and those not so connected 
(the floating capacitors). The m floating capacitor subgraphs are 
defined as the m unconnected subgraphs obtained from the floating 
capacitor plus imbedded vs graph of the transformed network. 

The partitioning of the capacitance matrix will be related to the 
example of Appendix A in the discussion that follows. Partition the 
matrix C as 


nl n2 n3 n4 n5 
nl {Cu Ci Cis O Cis 
n21Cx Co Co O 
n3 }C3, Cs. C33, 0 
n4};0 0 0 0 0 
n5 LCs, Cs. Css 0 
where 


(t) The nodes nl are all the nodes to which capacitors are connected 
omitting the following nodes: 


(a) A node for each vs imbedded in a capacitor chain (these nodes are 
in the n2 section), but each capacitor must be specified by at least one 
node. 


(b) A node for each of the m floating capacitor subgraphs (these nodes 
being in the n3 section). 


(c) A node for each a@csEeT which is specified in section n2. 

In the example in Appendix A, nl contains nodes 1 — 9. 
(zt) The nodes n2 represent: 

(a) A node for each pvs imbedded in a capacitor chain. 


(b) A node free of capacitors for each cpvsc and cpcsc free of capaci- 
tors on at least one node. 
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(c) A node for each GcSET. 
In the Appendix A example n2 contains nodes 10 and 11. 


(zit) Section n3 contains a node for each of the m floating capacitor 
subgraphs. 
In the example n3 contains nodes 12 and 13. 


(iv) Section n4 contains the nodes to which only resistors and cs’s (but 
not cpcsc) are connected, including a node for each Ivs, vpvVS or cpcsc 
free of capacitors, other cpvsc’s or cpcsc’s on both nodes. (The other 
nodes of these sources are specified in section n5). 

In the example n4 has no entries. 


(v) Section n5 contains all the remaining nodes. These are: 
(a) A node for each tvs. 


(b) A node free of capacitors for each vpvs or cpvsr free of capacitors 
or CDVSC or cpcsc on at least one node. 

In the example n5 contains node 14. 
The rank of the C matrix is nl, and nl = 9 for the example of Appendix A. 
Notice that the presence of capacitance loops in no way alters the method 
of partitioning. 


III. REDUCTION OF THE CIRCUIT DESCRIPTION TO A MINIMAL FORM 


Theorem: An upper bound on the order of complexity of a network ts the 
order of nl. 


This theorem is proved in Appendix B, where it is shown that every 
row in sections n2, n3, n4, and n5 is linearly dependent on rows in section 
nl; the subspace spanned by sections n2, n3, n4, and n5 is contained innl. 

The systematic reduction of equation (1) is accomplished by first 
eliminating section n5 by applying the voltage restrictions caused by the 
vs’s in section n5. Secondly, section n4 is eliminated using the fact that 
these nodes are free of capacitors. Next, section n3 is eliminated to 
correct the over specification of the floating capacitor subgraphs. 
Finally, the remaining dependencies of the system are caused by the 
pvs’s imbedded in capacitive chains, the cpvsc and cpcsc free of capaci- 
tors on at least one node, and a node for each capacitive tree in which a 
GCSET has occurred; these dependencies are eliminated with section n2, 
yielding equation (9) of Appendix B. 

Equation (9) of Appendix B can be written as 
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vl = B — Avil (2) 
where 


vl 
v2 
v=|v3 
v4 
v5 


and vl is the voltage vector for nodes nl, n2, n3, etc. 


An example of a solution based on the problem set by Pottle is 
given in Appendix C. 


IV. CONCLUDING REMARKS 


A state variable technique has been described that offers two ad- 
vantages over traditional methods: 

(1) The network can be specified completely by inspection using 
well known nodal techniques with little skill required, the problem 
then becoming one of simple matrix reduction (easily programmed 
for a digital computer). 

(ii) Capacitor loops present no problem and are not even recognized 
as such since the partitioning and matrix reduction are unaltered if 
there are any capacitor loops present. 

The main disadvantages are that currents must always be ex- 
pressed as functions of node voltages and inductors must be replaced 
by gyrators and capacitors; inductor cut-sets must be recognized and 
the circuit redrawn before inductors are eliminated so that the cut- 
set encircles one node only, and this is sometimes inconvenient. 


APPENDIX A 


Example of Partitioning 


For the example of Fig. 2(a), the transformed circuit without in- 
ductors is given in Fig. 2(b). (This is a theoretical problem and the 
circuit has no practical value.) This circuit is described by the equa- 
tions 
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kVp wlcy 





(b) 


Fig. 2— Circuit to demonstrate transformation and partitioning. 
where J; and I. are the unbalance currents due to the vs’s and 


V4 Vs . 


Notice that except for degenerate cases (for example, if Cg = 0), the 
order of complexity of this network is 9. 


APPENDIX B 


Matrix Reduction 


Consider the partitioned form of equation (1). The section n5 can be 
eliminated as follows: for an tvs of @ volts connected between nodes k 
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and I (node k is in section n5, node 1 is not) 
Uv, = U1 + a. 


For a vpvs or cpvsr connected between nodes k and 1, where the vs is 
dependent on the voltage vector v,, (each voltage of v,, is not in section 
nd). 


Vy, = 0 + BVn. 
Thus the system can be reduced to 


nl n2 n3 n4 


nl |Ci Ci. Cis 0 |] v1 il Gir Gio Gig Gia || VI 
m2 jC Co. Cos O |) 2] _ | i2| | Gor Goo Gos Goa || v2 (3) 
n3 {Csi Cz. C33 O |} ¥3 i3 Gsi  Gs2 G3 Gaza | | V3 
maLO OO O 0 E i4 | a Gan Gaz Gaa_l V4 


Nodes n4 can be eliminated by first writing part of equation (8) as 
vl 
v4 = Gas) 14 — [Gar Gaz Gus] | v2 (4) 
| v3 | 


Cir Cie Cis |] V1 il Gir Gio Gis || v1 G4 
Coy Cor Cog |} V2) = | i2| — | Gor Goo Gos} | v2 | — | Gog |v4 
C31 Cs. Css} LW3 i3 Gai Gyo Gss_| LV3 Gs 
itl Gli, Gliz Gis || vl 
=/it2}—|Gl.. Glo GI1.; || v2 |. 
it3 Gls: Gls. G1s3_1 Lv3 


The matrix 
Cu Cw Cis 
Cor Coz Cos 
Cs Cse Co 


has order nl + n2 -++ n3 and rank no greater than nl + n2. The n3 
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linearly dependent rows and columns can be deleted from equation (5) 
by adding selected rows in the section nl and n2 to rows in the range n3. 
The selection is made as follows: starting with any row in the section n3, 
examine the first entry. If it is nonzero add row 1 to this row. Continue 
along the row, repeating if necessary, until all the entries are zero. 
Proceed for the other dependent rows. Equation (5) can then be written 
as 


Cag Cs Ca v1 itl G1,, G1, G1,3 vl 
vw2/= - v2 (6) 
ope, Cos Cos v3 it2 Glo, Glo». G1 23 v3 


and 


v3 = G1iz i — [G1}, G132] al (7) 
v2 


It is a simple process for the reader to prove to himself that elimi- 
nating a node of a floating capacitor subgraph which is part of a 
GCSET as described above yields the same result as equating the alge- 
braic sum of the voltages across the capacitors in the GcsET to zero 
(analogous to the algebraic sum of the currents entering the inductor 
cut-set node through the inductors adding up to zero). 

Substituting equation (7) and its derivative into equation (6) we 


obtain 
i =| al _ a = as sa Mi (8) 
C20, C22} LW2 ip2 G21 G25 | v2 
The total number of restrictions have not yet been placed on the 


network. 


(<) For a pvs imbedded in a capacitor chain or a cpvsc free of capaci- 
tors on one node connected between nodes k and 1, where node k is 
specified in section n2, 


y= + YW; 
or 
vy, = V1 + vv; 


where v; is the set of voltages upon which the source is dependent. A 
particular voltage of v; may be in any section nl, n2, n3, n4, or n5. 
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(%) For a cpcsc connected between nodes k and lI, the currents 
I, and I, injected into nodes k and l with the cpcsc removed must be 
modified to 

i,t nV; 
and 
I, - Vj, 


respectively, where 7 has the dimensions of farads. 

(iz) For a acsreT with node j of the capacitor tree containing the 
GCSET specified in section n2, node j is eliminated as follows: examine the 
entries of row j of the remaining capacitance matrix. If entry C;,, # 0, 
subtract C;,:/C,,, times row l from row j, where! = 1, p; p is the order of 


nl + n2. Thus row jis reduced to a row of zeros. 
The system can now be written as 


[C]v1 = iF1 — [G]v1. (9) 


Barring degeneracy, matrix C is nonsingular with rank n1. 


APPENDIX C 


Example of the Method 


For the circuit of Fig. 3 (the example of C. Pottle’), nodes 1, 2, and 3 
are placed in the nl section, and node 4 is placed in the n5 section. Thus, 
by inspection 


C; + Cy 0 0 —C, bh eee =a 01) — 2G2(v2 = 01) 


0 CC, 0 O |fiz}_ 0 
0 0 Cs; O Vg 0 
—C, 0 O Cy JL%, I, | 
G+G@, ~~, —G, 0 |fn 
| -G@ GAtG 0 —-G@|\» 
—Gs 0 Gz;—-G, 0 V3 
0 —G;, 0 Gi »,_| 
where 


y= LH; iy = &. 
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Ga 
2Ca4(Vg-V) + 
2G4(Va-V;) 


Tig. 3— Example of Appendix C. 


The derivative of the source H must be considered if a capacitor is 
connected to both of its nodes. Clearing out the voltage terms in the 
current array, 


Ci a Cy 0 0 C; On 





0 0 C, O |/ds 
—C, 0 O Cyt Lay 
0] |G,-G  G, —G; 0 |Ia 
wi OP yc. | GE IGG. 0 —G, || 0» 
0 —G; 0 G,;,+G, 0 U3 
Ty 0 —G; 0 G; ty 
Eliminating v, and #, as described in Appendix B, 
Ci—C, 2C, 0 |fo —C,E) 1G;-G, — G, —G; lly 
0 C, Offa /=| GL j-| -G G+aG, 0 Uy |. 
0 0 CstLos 0 —Gs 0 G3 + Gs|Lvs 


This is as far as we can go symbolically and as far as the method 
takes us. Normally all that remains is a simple inversion of the re- 
maining capacitance matrix, but Pottle chose C,; = C,. This makes 
the capacitance matrix singular and so another node must be elimi- 
nated. Eliminating node 1, 


2GiGs C1G, 


Gtgetg Oa} \r+grgs 
2C,,G, ° CLG; ad 
=C: Cs | | 03 Ge. E 
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G __ GAGs 
= G; + Ge + Gs af? G. Gs oon Gs V2 
GG __«& 
G, _ G, G3 + Gr G, _ G. V3 


The vector 


Ln 
Us 
can now be expressed explicitly. 
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Uniform Asymptotic Expansions for Saddle 
Point Integrals—Application to a 
Probability Distribution Occurring 

in Noise Theory 


By STEPHEN O. RICE 
(Manuscript received June 5, 1968) 


The noncentral chi-square distribution occurs in noise tnterference 
problems. When the number of degrees of freedom becomes large, the middle 
portion of the distribution ts given by the central limit theorem, and the tails 
by a classical saddle point expansion. Here recent work by N. Bleistein and 
F. Ursell on “uniform” asymptotic expansions 1s combined and extended 
to obtain an asymptotic series which apparently holds over the entire range 
of the distribution. General methods for expanding saddle point integrals in 
uniform asymptotic series are discussed. Recurrence relations are given for 
the coefficients in two typical cases, (1) when there are two saddle points and 
(iz) when there ts only one saddle point but it lies near a pole or a branch 
point. 


I, INTRODUCTION 


This paper deals with the problem of obtaining asymptotic series 
for the complex integral 


Tm | Pg(i) exp [ah()] dt (1) 


when x becomes large. Problems of this sort are quite often encoun- 
tered in applied mathematics, particularly in wave propagation. The 
material presented here grew out of some recent work by G. H. Rob- 
ertson' on the “Marcum Q-Function.” This function, which appears 
in the study of radar interference, gives the distribution of the random 
variable (noncentral x”) 


z= (I/2) Dat. (2) 


1971 
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Here x is a positive integer and y;, Yo,..-, Yo are independent gaus- 
sian random variables with unit variances and mean values which 
may be different. 

Mr. Robertson has devised an algorithm for computing the Q-func- 
tion which may be used for a wide range of the parameters appearing 
in the function (that is, in the noncentral ,? distribution). In an 
earlier paper on information theory, by working with an integral of 
the type in equation (1), I had obtained an asymptotic (for large 
x) expression for the tails of the distribution.2 However, comparison 
with results obtained by Robertson showed that my expression failed 
badly in the central part of the distribution where the central limit 
theorem holds. 

The need for an asymptotic expansion which holds uniformly over 
the entire range of the distribution led to a study of the recent work 
on “uniform” asymptotic expansions of integrals. The first part of 
this paper is an exposition, plus extensions and generalizations, of 
some of the procedures which have been used to obtain uniform 
asymptotic expansions of integrals of the type in equation (1). The 
theory is then applied to the noncentral ,? distribution. 

Two procedures are considered. For convenience, we call them the 
“Bleistein method’* and the “Ursell method.”* Although these names 
are among the best that suggest themselves, they are not entirely 
satisfactory because they contain no hint of the earlier work by 
others, especially Olver, Chester, Friedman, and Ursell.>*° Here we 
have recast the underlying ideas used by Bleistein and Ursell into 
forms better suited to our purpose. 

Both methods lead to the same asymptotic series. The Bleistein 
method gives a compact expression for the coefficients in the expan- 
sion. However, from the few examples that have been studied, it 
appears that the labor required to reduce this compact expression to 
a computable form is at least as great as that required by the Ursell 
method. 

Section III and Appendices A, B, and C are concerned with a prelimi- 
nary change of variable in the integral J. The case, denoted by “‘A = 1” 
for brevity, in which the exponent ) is a positive integer, is discussed in 
Sections IV, V and VI. This material is applied to the problem of two 
saddle points in Appendix E. The case in which ) is general, denoted 
briefly by “A + 1,” is discussed in Sections VII, VIII, and IX, and in the 
examples in Appendices I’, G, and H. The results of Section IX are ap- 
plied in section X to obtain the desired type of expansion for the non- 
central x’ distribution. Useful results regarding classical saddle point 
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expansions are stated in Appendix D. Some of the results given in 
Appendix F for the general case of a saddle point near a branch point are 
applied in Appendix G to obtain an asymptotic series for the Poisson- 
Charlier polynomial, a polynomial of interest in traffic theory. 


I. STATEMENT OF PROBLEM 


The general problem is to obtain an asymptotic series for the inte- 
gral J defined by equation (1) when x becomes large and most of the 
contribution to J arises from a (rather loosely defined) “critical re- 
gion” around ¢ = 0. The path of integration L’ is supposed to start 
and end at |t]| = o in “valleys” in the complex ¢-plane where 
exp[xh(t)] > 0 as |t| > oo. Let the starting and ending valleys be 
denoted by S and #, respectively. The path L’ starts in S, climbs up 
to and passes through the critical region, and then descends down 
into EL. 

The functions h(t) and g(t) are analytic in the critical region; 
and one or more saddle points, that is, points where h’(t) = dh(t)/dt 
vanishes, lie in the critical region. We assume (0) = 0 and that z is 
real and positive. If « were complex, the factor exp(z arg x) could Be 
included in h(t). 

The path L’ may be deformed into a path D consisting of (7) ‘ihe 
of steepest descent which pass through some or all of the saddle 
points plus possibly (27) loops around branch cuts and poles. The 
path D is independent of x. When z is extremely large, all but a 
negligible part of J arises from contributions of very small portions 
of D. If ¢ = 0 is a singularity, one portion may lie close to ¢ = 0. 
Another portion is centered on the highest (that is, largest exp[xh (¢) |) 
saddle point. If the two highest saddle points are of the same height, 
a portion is centered on each, and so on. Thus when w is extremely 
large, the asymptotic series for J may be obtained by the classical 
r “usual” saddle point method. 

However, we may wish to compute J for values of « which, though 
large, are not large enough to allow J to be evaluated by the classical 
saddle point method. For such 2’s the highest saddle points and the 
singularity (for \ ¥ 1) at ¢ = Ocannot be treated separately, that is, their 
interaction must be taken into account. If other saddle points of lesser 
height lie in the critical region, they must also be considered. This is the 
range of x of interest here. Our problem is to obtain the appropriate 
expansion of J in descending powers of x. The type of expansion we seek 
is shown in equation (46) for J. 
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This type and the type shown in equation (15) have occurred in 
earlier publications*-* and have been called “uniform” asymptotic 
expansions because they hold uniformly as a saddle point approaches 
a singularity or another saddle point. 


III. CHANGE OF VARIABLE 


In Appendix A it is shown that, in the critical region, h(t) behaves 
much like a polynomial of degree » + 1 in ¢. Here p is the number of 
saddle points in the critical region. This suggests changing the variable 
of integration from ¢ to v where 


Fv) = h® (3) 


and F(v) is a polynomial of degree » + 1 in v. When F(v) is known, 
solving (8) for v as a function of ¢ gives » + 1 branches. The branch 
chosen for the change of variable is the one for which dé/dv ~ c through- 
out the critical region, c being a constant. That one and only one of the 
uz + 1 branches has this property is rendered plausible by the discussion 
in Appendix A. 

Fortunately we do not have to solve equation (3) to obtain the 
asymptotic series we desire. However, for some steps we do need the 
values of dt/dv and higher derivatives at the saddle points. These 
may be obtained by repeated differentiation of (3). 

F(v) is not uniquely determined by h(t). The factors which influ- 
ence its choice are reviewed in Appendix B. 

The change of variable from ¢ to v carries the integral (1) for J into 


pe / oY) exp [xF(@)] dv (4) 
where 


f(r) = g C/U, t= dt/dv. (5) 


The path of integration L starts in the v-plane valley corresponding 
to valley S in the ¢-plane, passes through the critical region surround- 
ing v = 0, then descends into the v-plane valley corresponding to 
valley EH. 


IV. THE BLEISTEIN METHOD FOR A = 1 


For the case A = 1, the integral J becomes 
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hy 
ll 


[90 exp tenn] at = [ 40) exp [2FO)] ey, a 


fo) = oS = g(t. 


The Bleistein method begins by constructing a polynomial Po(v) of 
degree » — 1 such that Po(v,) = f(v-), r= 1, 2,..., » where v1, V2, 

, V, are the zeros, assumed simple, of F’(v). By Lagrange’s inter- 
polation formula, 


_ S_1Oor') 
PO = EP") " 


where the primes denote derivatives. The polynomial may be written as 


Py) soe | See ee | FO— FO | dg 


s cl = o-— OFM — ~~ Qai (¢ = v)F'(s) (8) 
au (OQ, ») 
>. al ao FO F'(¢) 
where Q(Z, v) is a polynomial in v of degree p — 1, 
ag.) = FOO ) 


C=7* 
and f(v) has been added to remove the contribution of the pole at 
£ = v. The path C is taken in the counter-clockwise sense and encloses 
¢ = v and the zeros of F’(¢) but no singularities of f(Z). 

The expression for f(v) obtained from (8) gives 


l= / dv fv) exp [xFQ)] (10) 
? 10 


= | dv Pv) exp [xF@)] + [ dv exp [xF(v)] = = ore 


In order to simplify interchanging the order of integration in the 
double integral, we cut off the tails of LZ in the usual fashion. The 
error introduced by truncation is exponentially small compared with 
the terms that remain. Deforming C so that it encloses the truncated 
L (in the sense that it encloses the point £ = v for all v’s on the trun- 
cated L), interchanging the order of integration, integrating by parts 
with respect to v, neglecting the contributions from the integrated 
portions at the ends of ZL, and reverting to the original order of inte- 
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eration carries (10) into 


/ dv fv) exp [zF(@)] = dv P,(v) exp [zFQ)] 


1 dg {(O(-)) 
regard 7 a OT 11 
mil FOE | OY 
Incidentally, if the contributions from the ends (say at a and b) 
of the truncated L were not neglected, the right side of (11) would 
contain the additional term 


| He — P,v)] exp fer) | 
xl’ (v) oy 


The procedure used to establish (11) can be used to show that, 
for any function f,(¢) analytic inside C, we have 


+ : [ dv exp [xF(v)] 


| a 1.0) exp f2F) 


= [ dv P,(v) exp [zF()] + : [ dv fasiv) exp [xF')] (12) 
where 


_ 1 f at ho(-v 
bi) = i Je POE 





(18) 
1 QE, 2) 
P.O) = 5 far sco] EP]. 
Setting fo(Z) = f(f) and using (12) repeatedly gives 
i 3 er. i dv P,,(v) exp [xF(v)] + Ry , 
n=0 ZL (14) 


Ry = 2 *" [ dv fyi) exp [aF(Q)] . 


Since Q(Z, v) is a polynomial of degree » — 1 in v, the same is true 
of P,(v) and we write 


uo-1 
P02) = Dipw, n=0,1,2,>: 
t=0 


unl N 


l= 2 U(x) Spar + Ry 


n=0 


(15) 
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where 


Ur() = | o exp [2F@)] ao, PEG Tt as0. yee an C16) 


The series (15) is the type of expansion we seck. It would be desir- 
able to have close inequalities for Ry, but none are available at the 
present time. 

Another expression for P,(v) may be obtained from (13): 


lf dQG,» 1 f de fa@(—) 
Qride Ff) Qari Jo, FOE -— 9’ 


1 1 0 Q&,») 
Int i dz f| py dz F(z) | (17) 


Q(t, ») 
Omi ae df iol 74 ) 2] PG) 


In the first line C, must enclose the point ¢ = ¢ in the z-plane in addi- 
tion to the zeros of F’(z). Hence initially C, encloses C. When the 
order of integration is interchanged, the only singularity of the inte- 
grand in the ¢-plane lying outside C is the double pole at ¢ = z. Ex- 
pand C until it consists of a circle of infinite radius at oo plus a nega- 
tive loop around £ = z. The contribution of the infinite circle vanishes 
because the integrand is a rational function of ¢ of 0(f*) at o. The 
contribution of the pole at £ = z gives the derivative. 

Notice that the coefficients p,; in (15) are independent of the path 
Lin the v-plane. 

The procedure used to obtain the integral (17) for P,(v) may also 
be used to show that 


fan) = 55 fat HO vl wp alert (18) 


When » = 1 and F(v) = v0, the polynomial P,(v) reduces to 
(—1)"f°" (0)/4'n!) and f,+1(v) is equal to [f.(%) — off) — fa(0)]/ (20°). 


P,Q) = 








| 





V. COMPUTATION OF P,(v), } = 1: BLEISTEIN METHOD 


We shall regard the functions U;(x) in the series (15) for I as tabu- 
lated or easily computed. For example, when » = 2 the functions 
Uo(x) and U,(x%) may be expressed in terms of Airy functions. Then 
the most difficult step in applying the series is the calculation of the 
coefficients pyr, 1 = 0, 1,...,» — 1, of the polynomial P,(v). We 
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desire an expression for p,; in terms of the values of the functions 
g(t), h(t) and their derivatives at the saddle points ¢ = U,, r = 1, 2, 
oy Ye 
Let ¢ = t(v) denote the change of variable from ¢ to v, and let the 
saddle point v = v; in the v-plane [F’(v,) = 0] correspond to ¢, in the 
t-plane: ¢, = t(v,). We shall use the notation 


ho = Iekom , git = (ool 
1) = (ym | , re1,2,-°* yp. 


When convenient, we shall write h, for h“& = h(é,) and g, for g = g(E,). 
First consider the expression (7) for Po(v). As shown in Appendix 
B, F(v) isa polynomial of degree » + 1, 


(19) 


url 


FQ) = pa Aw (20) 


whose coefficients A; may be expressed as functions of the h,’s. When 
this equation for F(v) is used in (7), the coefficient of v? in the result- 
ing expression for Po(v) gives 


uti 


pa = Dy GA; eae 1=0,1,-:+,H—1. (21) 


r=l 
Multiplying the right side of (21) by —1 and changing the limits 
of summarion for 7 from 1 + 2, » + 1 to 1,1 + 1 gives another ex- 
pression for Po. 
Since f(v) = g(é)t™, we also need an expression for ¢“” in terms of g(t) 
and h(t). Differentiating F(v) = h(t) twice with respect to v and using 
h&? = 0 leads to 


FO.) = gt, P= [FV @)/h PY. (22) 


The sign of the square root is chosen to agree with the constant cin ¢ & cv, 
the form assumed by the change of variable throughout the critical 
region. 

Since the A,’s and v,’s may be expressed in terms of the h,’s, equa- 
tions (21) and (22) show that po; depends only on the h,’s, g,’s and 
h,()’s, 

When 7 is general, an expression for p,; similar to (21) for po; may 
be obtained by expanding the derivative in the integral (17) for P,(v) 
in partial factions and then using the Cauchy integral theorem. For 
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n = 1 it is found that 
wtl Bob ; 1 
Pu= Dy iA: DY ERT) 
yin? | fd = 1) et EN ow 02) (23) 


(v, — 0,)° v, — 0, 
where the prime on }*’ denotes that the term for s = r is omitted. The 
primes on f(v) and F(v) denote derivatives with respect to v. 
The expression obtained for P,,(v) is of the form 


uw 2nt+1 


P.e) = De afta @,)/(m — Y)! (24) 


where a,” is a polynomial in v. Recurrence relations for the a’s may be 


obtained with the help of the partial fraction expansion 


(Ca) Sy ee | ae (v, — 0,)* =| 
FO > —F%,) Cad: by (e= » C= . (25) 
The relation 5°’ [1/F’(v,)] = — 1/F’’(v,) can be used to simplify the 


coefficient of (£ — v,)7"". 
The mth derivative of f(v) evaluated at v,, 


#0) = S( ™) or] (2) “| (26) 


contains derivatives of t(v). They may be obtained by extending the 
method used to get ¢‘”. Straightforward differentiation of F(v) = h(t) 
with respect to v leads to 





2 
{> = 


1 
~ 3t; [Re oe Ree | 


(277) 
1 2 
{= Ke [FO — GH p4O2 — Axe — gp pO], 
where the nth derivative F™ of F(v) is evaluated at v, and h™, 
t™ are evaluated at t,. 


The values of ¢%+” for larger j’s may be obtained with the help 
of equation (94), namely 


Po) = Sh? Wes (28) 


where Cy, = ¢™, cy, = ¢@" and the remaining c’s are given by the 
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recurrence relation (96). Setting k = 1 in (96) gives 


n-2 
Cn.2 3 mee EP? 4+ » (" a eae (29) 
m=2 


forn = 3, the last summation being omitted when n =3. The term in (28) 
for k = 1 vanishes when v = v,, ¢ = t,. Substituting for c,,, its value 
given by (29) and solving for t~” leads to the desired result when n = 3: 


n- 1 n = n— 1 n—m) 4(m 
a 1) = 5, |r 3 he, ee 2d ( bs yu 46 | : (30) 
The value of F™ (v) is zero forn > p+ I. 


VI. COMPUTATION OF P,(v), X = 1: URSELL METHOD 


The Ursell method avoids the evaluation of the derivatives of f(v) 
which appear in equation (24) for P,(v). Instead, it makes use of 
classical saddle point expansions about the individual saddle points 
in the ¢ and v planes. 

Let u different paths of integration, L/, Li, ... , ZL! be chosen in (6) 
such that the chief contributions (as x > ~) along the paths correspond- 
ing tor, namely L/ in the ¢-plane and its mate Z, in the v-plane, occur at 
the saddle points t = ¢, and v = », , respectively. Let the classical 
asymptotic expansions around t, and v, be 


= [ g(t) exp [xh(t)] dt ~ exp [xh(t,)] yD Ont”? (31) 


[Ui(a)], = di v’ exp [xF(v)] dv ~ exp [aF(v,)] 3 Brimt”*. (82) 


Using h(t.) = F(v,), substituting (81) and (32) in the uniform asymp- 


totic expansion (15) with N = ©, and equating coefficients of «~”"? gives 


ual 


arn = » pS Br imDn- m,t 


a (33) 


= »2, Br10Pat + > »D BrimDn— m,l 


where the second sum in the last line is omitted when n = 0. The 
expression (17) for P,(v) shows that P,,(v) remains the same, irrespective 
of the path of integration L, as long as f(v) and F'(v), that is, g(é) and h(é), 
remain the same. Hence, forn = O andr = 1, 2, ..., », (83) furnishes p 
simultaneous linear equations which may be solved for pu, 1 = 


~ 


7 
A 


~ 
a 
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0,1,---,— 1. Similarly when n = 1, (83) determines the P18, and so 
on. It turns out (see equation 39) that B,19 = v8.0. This allows us to 
write the simultaneous equations in the form 


P,(v,) = a0/ Broo ) as 1, 2, erate (84) 
n —-1 B 
PO). = (Bam step, ml 
a eg m=1 1=0 Broo 


Expressions for an and B,m may be obtained from the classical 
saddle point asymptotic expansion (103) given in Appendix D. Chang- 
ing n to 7 in order to agree with the notation of Appendix D gives 


7+} 27 (2i-n) 
= (n)’ iE al do Cees n)! = Dial a) 39 (35) 


where hi”, g® are the derivatives defined in equations (19), and 
QOes ie =2(r+1)...(¢+n-—1). Theb,,,’s are computed from 
the recurrence relation (100), namely 

n—m+1 


Deciace = n = n+l du ka,b m,n—-k+1 4 
starting with bo9 = 1 and using 


Oper?) 
(kh + Qh? 


The value of arg [—2/h } is equal to arg (é — t,) on the portion of L! 
(deformed into a path of steepest descent through t,) leaving ¢,. 
Similarly, 


ith 27 ae | 2Qji-n oar = peers n 
Bri; = (x)? | | dX ( ) 3; Dae »» Dn( 3) m+ (37) 


where now the 0,,,’s are computed from (100) with 


_ —2rikt?) ec (2 y | 
= (k ae 2)! pie ) Fr, —_ dv Fv) ‘ = (38) 
Setting 7 = 0 in (85) and (87) gives 


| —2 |? i) —2- 
ar = @'| 3] Or Brio = cx) | ek [ot ) 


Se 
Polv,) = ay Ee a” = = {PG fo f,) 


where ¢,’? and f(v,) are the same as in equation (22). The relation 


Oo, = b= 1,2,-+:. (36) 





ay 


i 
e 





(39) 
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P,(v,) = f(v,) is the starting point for the Lagrange interpolation formula 
(7) in the Bleistein method. 
Setting n = 1 in (34) and 7 = 1 in (85) and (387) leads to 


~ Bria 
Eas an a -z B00 Pot 


A —2t [Jig (1) (3) jp 5j (2? 
as = | h® ie —_— ork = g a aA 6h + 48h? : (40) 


A ll ~ 1 yee Pa (Lig MY hia 
a = Et 4 2 - 4p ss |Z oe AQP? 


where the subscript r on the braces indicates that the enclosed g’s, 
h’s, v’s, F’s have the subscript r. 

















VII. THE BLEISTEIN METHOD FOR \ # 1 


Here we deal with 


(oe i. 8 1g(1) exp [eh(i)] dt = : Y“4@) exp [xF@)] do 


fo) = g((tryt™, = t = dt/dv. 


The origin is now a singularity, and its vicinity may contribute to J just 
as the vicinities of the saddle points do. Accordingly, we now require that 
the polynomial P,(v) be such that P,(0) = f(0) in addition to P,(v,) = 
f(v,), r = 1,2, ... , wu. Assume for the moment that F’(0) # 0,that is, 
that the origin is not a saddle point. Starting with Lagrange’s interpola- 
tion formula and proceeding as in Section IV gives 


(41) 


_ {OF’®) , 4 _fe,)F'O) 
Po) = “ar7@) + 2G 0nF" 
SOF") de 

= ov — OLF(D 


1 f (QC, ») dé 
2nt Jo fF’) 


= jo) +55 


(42) 
2 SE) =F) 
where C encloses € = v,£=0,£=0,,r = 1, 2,..., » but no singu- 


larities of f(£). Here Po(v) and Q(Z, v) are polynomials of degree 
p instead of » — 1. 
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When the origin is a saddle point, P,(v) is still given by the expressions 
in (42) which contain integrals. In fact, we have P{(0) = f’(0) (with 


primes denoting derivatives) in addition to P,(0) = f(0). 
Much as in Section IV, we obtain 


[ aw £0 exp (FO) 


= / dv v** P,(v) exp [xF()] + a dvv frei”) exp [2F(Q)] 
where fo(v) = f(v) and 


nN Ont cho (¢—v)? 7 


ak. 2 IMOVE, V)_ 
P.O) = oi cF’(s) 


Equations (48) and ne lead to the desired series for J: 


J 


ll 


> “” i dv v’* P,(v) exp [zF@)] + Ry , 


n=0 


hye Se i dvv fraiv) exp [F(v)] dv. 


When P,,(v) is written out we get 
P,v) = yo pa n=0, ie 
t=0 


Mu N 
J = dX Vix) s Dut" + Ry , 
=0 n=0 


Vila) = | v** exp [xF(v)] dv, L=0,1,2,--+ ,p. 
L 


Furthermore, the recurrence relation (44) for f,(v) leads to 


Pp) = f Bhd) | 19 AC, »| 
2rite SFC) LS ag FD 


A-1 1 a] "Q(¢, vo 
=a, dg (OE Ee 2 F(6) 


fasiY) = ora dt f(O(-—De™ iE Al (At — dw Hove 














F’(§) ag 


(¢ — oF) 


(43) 


(44) 


(45) 


(46) 


(47) 
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When X = 1 the formulas of this section do not reduce to those of 
Section IV since they contain the additional condition P,,(0) = f(0). 
However, (46) gives the same series for J as (14) does for I because 
V,(z) can now be expressed as a linear combination of Vo(x),..., 
Vu—1(x) [which become Uo(x),..., Uz_1(2) ]. 

The only singularities enclosed by C in the integral (47) for P,(v) 
are poles at £ = O and até = v,,r = 1, 2,..., p. Evaluating the 
integral by Cauchy’s theorem gives the coefficients in P,(v) as the 
sum of derivatives of f(v) at v = 0 and at v = v,. The derivatives 
at the saddle points may be obtained by differentiating 


In fo) = Ing) + = 1) In + mt” (48) 


with respect to v and using the expressions for ¢‘” developed in Section V. 
The derivatives f (0) may be computed with the help of the series 


2 
hi as Sh Be gO > a (49) 


2! 3! 
where ¢;” denotes the nth derivative of t with respect to v at v = 0. The 
ti’s may be obtained by differentiating F(v) = h(é) repeatedly with 
respect to v and then setting v = 0. If F’(0) ~ 0, 


(1) 
(1) Py 


0 — 7Zay 
0 


(50) 


(2) (2) 4(1)2 
(2) _ Fo nee ho to 

b&b = qd) 
0 


where the subscript 0 refers to £ = 0 when it is on fh and to v = 0 
when it is on F. Higher order derivatives may be computed by using 
the results of Appendix C in much the same way as in Section V. 

It may be verified that 


{(0) = Go. ae 


X 1 (0) 4 (2) 
{(0) = io] air” suit + ae 


(51) 





where g{” is the nth derivative of g(t) with respect to ¢ evaluated att = 0. 


VIII. THE URSELL METHOD FOR \ + 1 


When the origin is not a saddle point, the » + 1 linear equations 
to be solved for the coefficients p,,,1 = 0,1,..., in P,(v) turn out 
to be 
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I 
_ 
bo 


P,Q) ie! as aa ~ > (2:2 em ’ r ‘bh (52) 
r00 m=1 l=0 r00 


aw SP (Bx=)p Pieces (53) 


Pno = 
Booo a=1 l+m=q 

where the summations are omitted when n = 0. The summation condi- 
tion 1 + m = q in (58) is also subject to0O S15 4,0 S mS o~. 

Equations (52) are given by the analysis of Section VI for the 
case X = 1 when g(t) is replaced by #~7g(t), v' by v'+4—1, I, by J,, 
and U;(x) by V;(x). The o’s and #’s in the rth equation of (52) are 
the coefficients in the classical saddle point expansions about ¢, and v;: 


te = | t’*g(d) exp [xh(t)] dt ~ exp [xh(t,)] 3 a," 
- n=0 (54) 


LVi@)). = i v'**" exp [2F()] dv ~ exp [xF@,)] dX Brant”? 


The a,; in (52) (with j for n) is given by equation (35) for a,; with 
‘2-") yeplaced by g{?'~” where 


9, 
GD = EG) = > ae g.” 


0% —. (1\ o- ge 
jo SD ("i gi ag 0 ae ae 
k=0 


The 8,4 in (52) is given by equation (37) for 8, with [ replaced by 
L+ A — 1 on the right side. 

Equation (53) arises from a consideration of the region around the 
singularity at the origin. As described in connection with equation (106) 
in Appendix D, let L/ be a loop enclosing the branch cut running out 
from t = 0, and let LZ, be its mate in the v-plane. Then, as x — ©, the 
a’s and #’s in (53) are defined by 


(55) 


, ; O*9(t) exp [xh(t)] dt ~ >> aonv™”™ 
LO’ n=0 (56) 


[Vitale = fo! exp [2PM do ~ YBa". 


Substituting (56) in the uniform asymptotic expansion for Jo given 
by (46) and equating coefficients of x~"— gives (53). 
Using the asymptotic series (106) to determine ay, and Bm leads to 


as ms gey= =a) re ft a —? » ia 00 Bere (57) 
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where the subscript 0 refers to the origin and by,» is computed from 
(100) with a; given by 


a, = —ho? /{(k + 1) ho”). (58) 


The value of ¢{? is the value of dt/dv at v = 0 determined by the change 
of variable from ¢ to v. Similarly, 


i 


Bos _. eS (d) (59 
Booo 7 BS m=0 m pe ) 


where, replacing 7 by 7”, bin is computed from (100) with 
a = —FS'P/kKE DIFFS], bz 1. (60) 


In Appendix F the theory which has just been developed is applied to 
the case of one saddle point (u = 1, \ # 1). 

So far in this section it has been assumed that the origin is not a saddle 
point. Now let the uth saddle point coincide with the origin so that F{” 





and h{ vanish. The » + 1 equations determining p,;,1 = 0,1, ..., 4 
are NOW 
Arn . = Brive 
PO) SSS Do me Pe pe AG) 
B;o0 m=1 1=0 Broo 
Pno = res = Ds » (Eo pe. (62) 
000 @=1 m+t=2q \PBooo 
1 n 
Pua = Boo aa — Boor1Pno — > >» ParaPe-s. | (63) 
010 a=1 m+l=2qt+1 


where the summations are omitted when n = 0. The values of 1 and m 
occurring in the inner summations in (62) and (63) must also satisfy 
OslsSpand0Sm&S o~. 

Equations (61) are the same as (52) except that r runs from 1 to 
» — 1 instead of from 1 to ». The a’s and f’s in (62) and (63) are the 
coefficients in the asymptotic expansions 


J, = , t\*9(2) exp [xh(t)] dt ~ > oye A 
L'0 = 64) 
[Vi@)]o = is v? exp [2F()] dv ~ >> Borne tt? 

LO — 
where, as discussed in connection with equation (109), the paths L/, Lp 


coincide with the paths of steepest descent through ¢ = 0, v = 0 except 
for indentations at those points. 
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Equation (62) is obtained by substituting (64) in the uniform asymp- 
totic series for Jy given by (46) and equating coefficients of 2 °"*””’. 
Equating coefficients of x “"*'*’’? gives equation (63). 

Some results for the case of two saddle points, one of which is at 
the origin, are stated in Appendix H. 

IX. SIMPLE POLE AT THE ORIGIN 


When there is a simple pole at ¢ = 0 and one saddle point in the 
critical region, a case discussed briefly by Bleistein,? we have 


oes | t? exp [xh(2)] dt. (65) 
> 


In the critical region, L’ is assumed to coincide with the linear path 
running froma —10 tow+t0,a> 0. 

Let h(t) be real when ¢ is real and in the critical region. Let the saddle 
point ¢, lie on the real axis. As usual, h, = 0, hi” = 0; and we assume 
h, <0, hi” > 0. As suggested by example (2) of Appendix 3, we choose 


F(v) = v? — vv, v= —hy 
where v, is real. We write 
t 1 1 
Ve Ta | oe (—h,)? 2 0 
1 
in order to make v,; and ¢ have the same sign. 


Equation (46) shows that the uniform asymptotic expansion for 
J has the form 


J ~ V(x) > Prot ” + V,(x) a Dut ” (66) 
n=0 n=0 


where, with LZ parallel to, and to the right of, the imaginary v-axis, 


V(x) = / v* exp [xF(@)] dv = ix[1 — erf @,2')] 
Vi(z) = / exp [xf (@)] dv = i(r/x)' exp (—20%) 


erf (z) = = i exp (—0°) dt. 
T vo 
Putting \ = 0 in the integral (47) for P,,(v) gives 


P,() = Pno ae Priv 
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os ae 1) ( 1 arte =) 
2QriJce 2° \E§ — yy, OF 5 yy (67) 


fe) = vt /t, P= di/dy 


where C encloses £ = 0 and £ = v, but no singularities of f(Z). Setting 
v = 0 in (67) gives 


P,(0) = poo = f(0) = 1, 
P.O) = po =0, n>O0. 


Here the series (49) for t/v has been used to show that f(0) = 1. 
Therefore the series for J reduces to 


J ~ in{l — erf [v,2°}} + i(r/z)' exp (—2v?) DS pat. (68) 
Setting v = v, in (67) gives 


He) ee 


ty V1 
1 f dI@ ee. g 


Pu = 2x1 Cc 20-6 (- — VU, 2) (; 7 -) ss oe 0. 
From (22) and F’’(v) = 2 it follows that t! = [2/h®]*. The integral 
for Dai may be evaluated in terms of the 2nth derivative of f(v)/v = 
t/t = (d/dv) In t(v). Thus, writing ¢/(§ — »,) as 1 + 0,(€ — 0,)7 
and using 


ll 


Poi 
(69) 





( aN —,)7 = (-1)1-3 ... Qn — YE — 1)", n > 0 


6 — v, 0¢ 
_ (yd, fu (2) | 7 \ 
Das get (2n)! L\dv a eo | 


leads to 

The first of the pny’s required in the series (68) for J is given by 
equation (69) for yo1. The remaining ones may be obtained by using 
the Ursell method equation (52). Since poo = 1 and pao = O for n > 0, 
equation (52) gives forn > 0 


| a a piay = »; BrinPen | . (70) 


m=) 





Ee V;B100 
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From equation (55), g(t) = ¢t”’ and its nth derivative at t, is gf = 
(—1)"nltz""*. Replacing g by @ in (35) leads to 


2 fe! ny-2jtn— = 
an; = a 2 | be (=) ty” : Ds mal) m+i 
1 n=0 m=0 
where arg [—2/h\?]? = 2/2 and the b,,,’8 are computed from (100) with 


a, = —2hi**? /[(k + 2)! hi], pS 12. tek (71) 


Equation (54) shows that the Bimm’s are the coefficients in the asymp- 
totic expansion of [V:(x)]1, that is, in the asymptotic expansion of an 
integral, which has the same integrand as V;,(x), taken along the 
path of steepest descent through the saddle point v = v;. Instead of 
obtaining the #’s by the general procedure outlined in Section VIII, 
we notice that the “asymptotic” series for V,(x) consists of only one 
term. Consequently Biim is 0 when m > O and the summation in 
equation (70) for pn»; disappears. Moreover, the asymptotic series for 
the error function gives, when v; > 0, 


Vole) ~ ile)* exp (—avt) D> (= "Gaal" 


When v, > 0 and x > o, Vo(x) is given asymptotically by the con- 
tribution from v1. When v, < 0, the asymptotic expression for Vo (x) 
contains the constant term 271, but the contribution from a path of 
steepest descent through v, is still given by the same expression as 
for positive v;. Hence, irrespective of the sign of vj, 


Brom = i(n)*(—1)"(3) nvr. 


These results enable us to write equation (70) (with j for n) as 


—1)7(4). ae 2741 27 7 n ; 
Pi = ( 1)" : [| Dy (—t) > big + Sa —. i} (72) 


=0 








for j => 0. Here, to repeat, 


v1 =a | i | ( hy) ? by ~~ hy? ) hi == | ty | tag p] 


(o=1, ©. =ee+)---C+n-} 


and the bmn,’s are computed in succession from equation (100) with 
a, given by (71). 
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The first two pny’s are 


1 (1) 
Pa = , E —: 1| 
1 1 (74) 


1 iE” ; 3 2/8 15,2 
Pir = ~ 293 t, {1 = 3t,a, ++ ti ($a, + *5-a;) | = 1 





and the b,,,,’s for po; may be read from the table in Appendix D. 

When | £; | is small, the expressions which have been given for pm are 
essentially small differences between large numbers. If the calculation is 
being performed on a digital computer it may be advisable to use double 
precision. Expanding h(t) about ¢ = ¢, and then setting ¢ = 0 leads toa 
series for —h, which may be used to obtain v;?/~* as [¢/t,]?’** times a 
power series in ¢, . Series of this type can be used to show that 


Pa = S ee a O(t;) 
(75) 
Pi = —4t Ba; + Vara, + at) + O(t) 
where a; is given by (71). 


| 


X. THE NONCENTRAL x? DISTRIBUTION 


Let x be a positive integer and y,, yo, ... , yz be independent gaussian 
random variables with unit variances and respective mean values 
i) Ja; ... » Jz . Let z be the (noncentral x’) random variable 


pees (76) 


A aay 


It may be shown that the mean value of z is Z = 1 + r and that its 
variance is (2 + 4r)/x where 


-lS eg 
r= hth. 


Furthermore, from Ref. 2 (with a change of variable) the distribution 
function of z is 


1 E+i0 _ 
Prob (0 $2 5) = 5 | ___ © exp [ah(0] dt (77) 
where é > 0 and 
h(t) = 3st -nM1 +0 +r +07 — 7I. (78) 
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The integral on the right side of (77) is seen to be equivalent to 
minus Marcum’s Q-function (and also to an expression given by 
R. A. Fisher) when (77) is written as 


Prob [(0 Sz Ss]= 5 i (2/r)°/?* exp [—ale + r)/2)] 2) -rfx(r2)" dz. 


Here J denotes a Bessel function with imaginary argument. 

We are interested in computing the distribution of z when z is large. 
The equation h® (t) = 0 gives two saddle points. However, as pointed 
out in Ref. 2, when z is large only the one at 


1+ (1 + 4rs)} 
2s 
need be considered. The value of ¢, is real and > —1. When s = 1 
+ r,t, is zero; and when s increases through 1 + r, ¢, decreases 
through 0. | 
From equation (68) the desired asymptotic expansion is 





= Se 


Prob [0 Sz Ss] ~ 3{1 — ef [v.2*]} 


+ AGrx)# exp (—a0?) D pat — (79) 


where pn1 is given by (72). The quantities entering pn, are 
n= 7 (hb), k= Mh), a? = 2/hT 


AM) = K-1I)'a—-—DY! [nd +0"7'4+0+097), n2e2 
hy? =r + 4)°% +270 + 4)? 
2-1)" | 2) Lt | ; 
(kK + 2)(1 + &)° ®+it+th J 
The values of po; and p,; may be obtained from (74) by substitution 
of the parameter values (80). 


When s — Z = s — 1 — ris small, the sonal limit theorem in the 
theory of probability states that 


Prob [0 sesg~{itet[ S>2e]. 


This agrees with the approximation given by the error function term in 
(79) when it is noted that t; ~ — (s — 1 — r)/(1 + 2r) and — h, & 
thi?/2ifs — 1 — ris small. 


(80) 





Qa, = =i. 
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The ordinary x? distribution is obtained by setting 7 = 0 in the 
noncentral x? distribution. In this case we have 











b= Ft, 4? = 2s, 
l—~s ns) 
= 81 
Vi La 9 ? ( ) 
_ _ 2(-s)' es 
a = kL? kee he 


Equations (74) show that the first two coefficients po:, p11 in the 
asymptotic series (79) are now 








jie os 2 1 
0 = rr 
eS em as ay. 
pu = ~44(2) «+ 12 ys 
When s is close to its average value 1, equation (75) gives 
Pou = 2/3 + O(4) (83) 
Pi = 1/135 + O(t,). 
Setting v = 2c and r = 0 gives 
Prob [0 Sz Ss] = a / u* exp (—u) du 


_ > (cs)” exp (—cs) on oa cs) 
where c is assumed to be an integer (x even) in the last equation. 
These relations may be combined with the foregoing formulas to 
obtain asymptotic results for the incomplete gamma function and 
the Poisson distribution. 

There is reason to believe that the asymptotic expansion (79) for 
Prob [0 S$ z S s] may hold over the entire range 0 S$ s S o. For exam- 
ple, consider the ordinary (r = 0) x’ distribution. In this case the first 
two terms in (79) give 


Prob [(0 Sz Ss] ~#{1 — erf [v,24]} 


+ stra? | 2 —FJesp (mi) 65 


s 
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where x = 2c and 2, is given by (81). Let c be held fixed at some large 
value and consider further the three cases s > 0, s > ©, ands—1.In 
all three cases it may be verified that the leading terms in Prob 
{0 = z S s] given by (85) agree with those obtained from the exact 
equations (84) and the asymptotic properties of the incomplete gamma 
function. The expressions obtained from (84) are 


Prob [0 < z Ss] = (2ne)"*s’ [exp c] [1 + O@s) + OC")], s—0 
Prob (0 <$ z S$ s] = 1 — (2rc)7's°"'[exp (c — es) ]{1 + O(s') + OC"), 
| so (86) 
Prob (0 Sz S51) =4+34Qmc)'+0C’), s=1. 
APPENDIX A 
The Behavior of h(t) in the Critical Region 


In this appendix we show that, in the critical region, h(t) behaves 
much like a polynomial of degree » + 1, and we examine the change 
of variable from ¢ to v. 

First write 

atl 4ip (i) 
h(t) = DD a0 + Russ (87) 
2 ! 
where hi‘? is the value of (d/dt)*h(t) at t = 0, hi“*? is not 0, and R ys: 
is O(t"*”). 

One of the distinguishing features of a polynomial in ¢ of degree 
p+ 1is that when ¢ is much Jarger than r, where | t| = 7 is the smallest 
circle which encloses the zeros, the dominant term in the polynomial is 
the one containing ¢“**. The function h(¢) has a corresponding property. 
Suppose that the saddle points ¢,, f., ... , ¢, all lie within a distance e of 
the origin, and for the moment suppose that they may be moved towards 
the origin so that « may be made as‘small as we desire. Also suppose that 
hit? = A + O(e) where A # 0. We shall show that by making ¢ small 
enough we may find a range p < |t| < 7» throughout which 


hit) eau + D!. (88) 
Here 7 is such that when |¢| and « are less than 7, the remainder F,,, in 
(87) is negligible in comparison with the (u + 1) term 7,4, = t°7°AS*t? / 
(u + 1)!. Once 7 is fixed, p may be chosen to be arbitrarily small, subject 


only to p < 7». 
In order to show that (88) holds when p < |t| < 7, notice that by 
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repeated differentiation of the representation 
Vd = t¢—t)(t —t)...  — t,) E® 
it may be shown that G(O) = Af“*?/u! + O(e) and that Aj” is 
O[e**? *hi*?] for 7 = 1, 2, +++ , w. Hence when |#| > «, 
ee 





fa Otte | os fs) 
Choosing p to be arbitrarily small, subject only to0 < p < 7, and then 
choosing ¢ so that e/p < 1 establishes (88). 

This property of h(t) suggests that some insight into the change of 
variable from ¢ to v, specified by F(v) = h(t), may be obtained by 
considering h(t) to be a polynomial ¢(é) of degree » + 1. Then a natural 
choice of F(v) is F(v) = ¢(cv + b) where c and b are constants. For 
simplicity we take c = 1 and b = 0 so that v, = t,,r = 1,2, ...,4 
where 0, is the rth saddle point on the v-plane. The equation F'(v) = h(t) 
goes into g(v) = g(t) which we write as 


utd F a 
ev) — g(t) = > Art) 
= (v — )[A, + Axo + t) + As” +0t +P) 4+ +++] 
= 0; 
The branch used in the change of variable is 


vet (89) 


for which dv/dt = 1 everywhere. The remaining » branches, which 
are ignored in the change of variable, may be obtained by solving 


A,+A(Wut+O+... tAyaiQv® +o0*Ut+ ... +t") =0 (90) 


for v as a function of t. Writing (90) as 


GW, ) = ew oe = a, 


and expanding G(v, t) about v = t,, t = t, shows that, near ¢ = t,, one 
of the remaining branches behaves like v = t, — (¢ — t,). On this branch 
v = t, and dv/dt = —latt = t,. Again, letv = 6,,s =1,2,...,yH—1, 
be one of the » — 1 roots of G(v, t,) = 0 which is not equal to ¢,. Expand- 
ing G(v, t) about v = 6,,t = t, shows that near t = t, the correspond- 
ing branch behaves like 
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v=0,+ 


and that dv/dt = Oatt = t,. 

The examination of special cases suggests that there is a one-to-one 
correspondence between the » branches of (90) and the » saddle points 
in the sense that dv/dt for a particular branch is equal to —1 at its 
corresponding saddle point and is zero at the other saddle points. Thus 
it appears that the branch v = ¢ [or its analogue for general h(t) ] 
is the only one suitable for the change of variable throughout the 
entire critical region. 


APPENDIX B 


The Choice of F(v) 


The polynomial F(v) used in changing the variable of integration 
will be written as 


u+l . 
Fe) = >> Ap’. (91) 
7=0 
The positions ¢t,, t2,..., t, of the saddle points and the associated 


values h, = h(t,) are supposed known. We require expressions for 
the A,;’s which are either pure numbers or depend only on the h,’s. 
Although one or more of the zeros v;, v2,..., UV, of F’(v) = dF (v) /dv 
may appear in our final expression for F(v), they will always be ex- 
pressed in terms of the h,’s. 

Since F(v) = h(t), we have the 2 equations 


F,) = h, , 
F’v,) = 0, pom Ly By tre i fe 


(92) 


relating the 2u + 2 unknowns 2, 2, ... , Vy, Ao» As, .-. » Ayer Con- 
sequently we have at least two arbitrary choices (A ,.,; = 0 is forbidden). 
For the case \ ¥ 1 we shall always require the change of variable to be 
such that v is 0 when t = 0 and thus we take A, = 0. In some instances 
the form of h(t) aids in the choice of the.A,’s. For example, when h(é) is 
an even function of ¢, we can take F(v) to be an even function of v. 

In choosing F(v) it is helpful to notice that in the critical region the 
change of variable takes the form ¢t % cv, c being a constant. Conse- 
quently, from (87), 

p+ (cv)'h” 


Foe) = oo (93) 


i=l j! 
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and it follows that the yp zeros, v,, of F’(v) have nearly the same configura- 
tion in the v-plane (except for a possible rotation and magnification 
given by argc and | c | , respectively) as do the zeros, t,, of h’(¢). For the 
case \ = 1 there may also be a small displacement so that t % cv can be 
written more accurately as ¢ = cv + b, or dt/dv & c. Furthermore, from 
(88), we can take c to be one of the roots of 


Aye = Oh /(u + YE. 


The following examples illustrate possible choices of F'(v). 
(i) w= 1, #1, t, ¥ O. Initially there are 4 unknowns, %, Ao, A1, Ao 
related by 2 equations, and F'(v) given by 


F(v) = Agu? + Ay + Ao. 


We take A, = 0 (because \ + 1) and arbitrarily choose A, = 1 (for 
convenience). This carries the two equations into 


vi + Aw, =h, 
2v, + A, = 0. 
Consequently 
F(v) = v? — 2vv v= —h. 
This case has been considered by Bleistein.® 
(ww) » = 2,A = 1. Initially there are six unknowns 0, V2, Ao, A1, A2, As 
related by four equations, and F(v) given by 
F(v) = Ago? + Anu? + Ay + Ao. 
We take Az = 0 in order to simplify F’(v). The four equations become 
F’',) = 8A, + A, = 0 
3FQ,) = —Ay, + 3A, + 3A, = 3h, , r= 1,2. 


It follows that Vg = —Vj, Ao = (hy + hz) /2, Ayv1 = 3 (hy = hz) /4. 
For the remaining choice we take A; to be equal to —A;/3 and 
obtain, 


F(v) = $(t2 — hi) — 30) + 3(he + hy). 
Another choice for A; is 4 which gives 
Fw) = 4? — viv + 4h, + Ay) 
vi = the — hy); 2 = VY. 


This case has been considered by Chester, Friedman and Ursell.® 


SADDLE POINT INTEGRALS 1997 


(wt) wp = 2, 1,4, ¥ 0, t, = 0. Here the unknowns and F(v) are the 
same as in example (72), but because of h, = h(0) = 0 it turns out we 
have three arbitrary choices. Since \ ~ 1 we take A, = 0. Then both 
F(v.) = h, and F’(v.) = Oare satisfied by choosing v. = 0 and A; = 0. This 
leaves F(v,) = h, and F’(v,) = 0 to be satisfied by the remaining three 
unknowns v,, Az, A3. Our third choice is A; = 2. It leads to 

Fv) = 2v° — 80,’, v= —h,. 


(iv) w = 2, € 1, th, t, ¥ 0. This case illustrates the complication 
encountered for the general case when up = 2. The value of A, must be 0 
and we choose A; = 1. Then 


Fv) =v? + Aw’? + Ay 
F'(v) = 30° + 2A + A,. 
The last equation shows that Az = — 3(v1 + ve) /2, Ay = 3v1V2. Sub- 
stituting in F'(v,) = h, gives 
v3(—0, + 3v2)/2 = h, 
v3(—0, + 38v,)/2 = hp. 
Setting a = ho/h,; and p = vo2/v, leads to 
p — 3p° + 3pa —a = 0 
which has the three roots 
p= 1d = ol — alo, + 0+ aor) 


where w, = 1, a = 7°, w; = 7 *” and the star denotes “conjugate 


complex.”’ When #, and é, tend to zero, one of the p,’s tends to ¢,/t,;, and 
this is the value of p to be used. The value of v} is equal to 2h;/(8p — 1), 
and we have 


A, = —30,(1 + p)/2, A, = 3pvj. 


(v) » = 3. The general case of three saddle points may be handled 
by a procedure similar to that used in example (iv). We do not dis- 
cuss this case beyond mentioning that when we set v2 = pv1, V3 = oV1 
the variable uw = (p—1)/(o—1) must satisfy the equation 

u* — Qu° + 2au —-a4= 0, a= (hy = he) / (hy = hs). 


(vt) » = 3, ts = 0, h(t) even. Since h(t) is even we start with 


F(v) = vy aa Aww” — As 
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and find that 
F(v) = v* — Quo’, vi = —hy. 


This case has been treated by Felsen.’ 


APPENDIX C 


Derivatives of Composite Functions 


A result used in Section V to compute the nth derivative, t, of t(v) 
at v = v, is stated in this appendix. Let the argument wu in h(u) be a 
function ¢t(v) of v. Then 


(2) neo = Dy a | (94) 


where h™) stands for (d/du)*h(u) evaluated at wu = t(v), and the 
coefficients c,, ; are computed from the recurrence relations 


Cry io. Aaa (95) 
Cn+1,k+1 = eS ue tees ) i s k < n (96) 
m=k 


in which ¢” denotes (d/dv)"t(v) and be the binomial coefficient. 


Equation (96) may be proved by induction. Differentiating (94) gives 


Cn+1,k+1 = Bote. & + © ona 5) 1 = k =< n— 1. (97) 


We assume that (96) holds when n is replaced by n-1 and use it to 
express Cn, x41 a8 a Sum. Then one of the terms in the summand for 
d Cn, x41/dv contains d Cy», ;/dv. From (97), assuming k > 1, 


(1) 
dp °"" = Cm+1,k = t Cm,k-1 ° 


Equation (96), with (n—1, k—1) for (n, k), lets us sum the terms 
containing t™c,,;—1 with respect to m. Equation (96) follows upon 
combining binomial coefficients and using cy, = €%cp_4, n-1.- 

The recurrence relations may also be obtained by writing the right 
side of (94) as a Bell polynomial and using the recurrence relation 
for these polynomials.® Expressions for the cn, ;’s (up to nm = 8) as 
polynomials in the ¢™)’s may be obtained from Riordan’s table of Bell 
polynomials given on page 49 of Ref. 8. 
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APPENDIX D 


Formulas for Classical Saddle Point Asymptotic Expansions 


A result useful in obtaining asymptotic expansions of integrals is 


iE G(x) exp [eH (x)] dr 


oo 1 -1 (pt+i)/y 7 n ( e+ i) 
EUG" Bo Earle re) ow 


3=0 V m=0 
where « > 0, Re p > 0,v is a positive integer, and 


ao 


G(r) = DS T Gh ; H(r) = > v’H,, . (99) 


The bn»’s depend only on H(r) and are computed in succession, start- 
ing with boo = 1 and bo, = 0 for n > 1, from 


n-m+t 


1 
Dutt nity = n+1 >> Kay 0mn—K+1 . (100) 


k=1 
Here a, = — H,.;/H,,k = 1,2,.... 

Special values of b,,, are given in Table I. 

The asymptotic expansion (98) is based upon the gamma function 
integral 


| u’* exp (—u’) du = : r(2) (101) 
0 v yp 
and the expansion 
exp E dX a | = > ED Dn” (102) 
n= m=0 


The recurrence relation (100) may be obtained by differentiating 
(102) with respect to é, replacing the exponential by its series, and 
then equating coefficients of &y™t?. 


TABLE I—SPECIAL VALUES OF Din 




















n bon bin ben ban Dan ban 
0 1 
1 0 ay 
2 0 ae ay?/2 
3 0 a3 02 a,3/6 
4 0 a4 a\a3 + 21a.?2 2 1a;a2 a,*/24 
n>1 0 On — — — a,"/n! 
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For an integral, say J, in equation (31), having a simple saddle point 
att = ¢, and a path of integration L/ which runs up to, through, and then 
down from #, , we can use (98) with y = 2,p = 1,7 = t — #t,, G(r) = 
g(t, + 7), and H(r) = h(t, + 7) — Att,). The contribution of the saddle 
point is obtained by deleting the terms in (98) for which j is odd, 
doubling the terms for j even, and taking arg (—1/cH,.)' = arg 
[—2/xh‘? }* to be equal to arg (t — t,) on the part of L’ just leaving t,. 
The result is 


[90 exp [en(o)] at 


7+3 0 27 (2¢—n) 
~ exp [eh] py |: =3,| Lael » Bal m + j + 4) (103) 


where the derivatives of g(t), h(t) are defined in equation (19) and 
Bin IS computed with 


) ania 
“(eaytne” 


Qa, = 


(104) 


The gamma function I'(m + j + 4) may be written as Vr (4) maj. 

For the integral Jo given by equation (56) most of the contribution 
comes from the region near the branch point at ¢ = 0. When t = 0 is not 
a saddle point, there is only one path of steepest descent {for exp [xh(t)]} 
leaving t = 0. This path may be taken to be the branch cut in the t-plane 
and the path of integration L/ for J) may be taken to be a positive loop 
enclosing the cut. Then the asymptotic series for J) may be obtained 
from (98) by setting vy = 1,p = ,7 = t, G(r) = g(t), H(r) = h(t) and 
using in place of (101) the integral 


im u** exp (—u) du = [1 — exp (—7227z2)|T) = ee 


(105) 


Here arg u is 0 on the part of the path of integration leaving ¢ = 0. 
The positive real u-axis in (105) is a branch cut. The path of integra- 
tion starts at w = + oo on the top side of the cut, comes in along the 
cut, encircles u = 0 in the positive direction, then runs out to u = +0 
along the bottom side of the cut. 

It is found that 
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T= i "9(8) exp [xh(d)] dt 
Le’ 


- > E is i a 
aise 


7=0 n=0 GQ —n)! 


(106) 





» baal — exp (—2rid)]P(m ++ 4) 


where arg [—1/zh{"’] is equal to arg t on the part of L/ leaving t = 0, and 
ban is computed with 


here 


ee DPR 


The last relation in (105) may be used to handle the case in which A 
is 0 or a negative integer. 

When t = 0 is a simple saddle point as well as a branch point, the 
path L/ can be taken to coincide with the path of steepest descent 
through t = 0 except for an indentation at t = 0. The indentation is 
chosen so that a man walking in the positive direction along L/ would 
have the point t = 0 on his left. We put v = 2, p = XA, G(r) = gf), 
H(r) = h(é) in (98) and use in place of (101) the integral 


(107) 


a, = 


ix exp (—t1z/2) 


3 


(108) 
Here K runs from u = —o to uw = +00 with a downward indentation 
at u = 0, and arg u is 0 on the part of K leaving u = 0. Instead of 
(106) we now have 


hae exp (—u’) du = 3[1 — exp (— ina (2) = 


h-1 2 oe i, gfimm 
[Pr 9(0 exp eno) at~ © (FR) 


7=0 


Do Onng {1 — exp [—ir(A + pyr m ats A?) (109) 
m=0 

where arg [—2/zrh]? is equal to arg t on the part of L/ leaving t = 0 
and b,, is computed with 


k 
ae? 


E+ DT _ 


a, = 


2002 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1968 


APPENDIX E 


Two Saddle Points 


In order to illustrate some of the results of Sections IV, V, and VI, 
we consider the case of two saddle points, » = 2. This case has been 
discussed by Chester, Friedman, and Ursell.*;® From (15) the desired 
expansion is of the form 


[060 exp [en() at ~ Vala) Duo™ + Wile) DO past, 


(111) 
P,(v) = Pno + Paw 
where, from example (27) of Appendix B and equation (16), 
C= | Mem ERO a, TF =0ri 
L 
Fv) = 40° — vy + Ad, (112) 


Ay = 3(he + hy), v= ihe — hy), Ve = —V,. 
Arg v, is determined by the correspondence of v; with t; which comes 
with the change of variable from ft to v. 
Let L’ and the change of variable from ¢ to v be such that Z runs 
in from v = © exp(—i7/3) to the critical region near v = O and 


then out to oo exp(ix/3) (it may be necessary to reverse the direc- 
tion of L’). Then 


U,(x) = 2rix Ai(a'v?) exp (xAy) , 
U,(a) = —2mrix™ Ai’(x'v?) exp (x-Ao) 
where Az(z) is the Airy function and A2’(z) its derivative with re- 
spect to z. 
From F’(v) = v” — v? and equation (9) it follows that Q(f, v) is ¢ + v. 
Consequently equation (17) for P,(v) gives 


_ 1 fae me De le Se le 
Here f(v) = g(t in which t™ = dt/dv is obtained by differentiating 


F(v) = h(t) with respect to v. The path of integration C encloses ¢ = +1, 
but no singularities of f(¢). 


(113) 








From v. = — v,; and 


Poo Se U;Po. = P,(v,) ie f(v,), ore i 2 
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we get 








Poo = f@1) thoy) : Do = fw.) — f(—) (115) 


20, 
which may also be obtained from equation (21) for poy. 
Putting v = vy, n = 1 in (114) and expanding the integrand in 
partial fractions leads to 


Pi@) = ss [f(—v1) — f@.) + 207° ©) =e 203 f,)). (116) 


Expressions for pio and p,; which agree with equation (23) for pi; 
may be obtained from (116) by changing the sign of v, to get P; (—v,) 
and using n = 1 in 
_ P.O) + P=) 
Pno 7 9 
a PAOy) = P,(—2,) ; 
nl 20, 
If we were to continue with the Bleistein method we would have to 
evaluate f(™(v,) by using equations (26) through (30). Instead we 


turn to the problem of obtaining P,,(v,) by the Ursell method. For p = 
2, equations (34) become 


P,v,) = aro/ Broo ) C= iE, 2 





(117) 





(118) 
P,(v,) = Brn $7 BromPn-m.o te Daim Dastalt 
Boo m=1 B00 
where a,; and 8,1; are given by equations (35) and (387). 

Since g(t) and h(é) in the original integral (111) are quite general, we 
use equation (35) for a,; as it stands. However, equation (37) for 8,1; 
simplifies considerably. This is to be expected since it gives essentially 
the coefficients in the asymptotic expansions of A7(z) and A7’(z). From 
Fo = 2v,, FS = 2, and F” = 0 forn > 3 we have a, = —1/(20,), 
and a, = Ofork > 1. It turns out that b,,,1s 0 form ¥ nand b,, =at/n!. 
When J is set equal to 0-in (87), all terms vanish except the one for 
n = 27,m = 23. When / = 1, all terms vanish except those for n = 2), 
m = 2j7andn = 27 —1,m = 2] — 1. It is found that 


_ {7 : ae —1\*" _()s: 
B,oo = ( v, ? Bro; — Broa -1) 97(23)! 


= 1+ 6i) 
Bri; a »,(3 = 6) Byo; * 








(119) 
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Changing P,(v,) into P;(v,) in order to avoid confusion with the 
n used in Appendix D carries (118) into 


Pie) = OL 2) Ff So 
k Ae (2k — n)! f mn mt+k 








n=0 
1 + 6m 
=D oa 4 (2m)! a 1y “Daal pee eo ees (a+ om ai | (120) 
Here r = 1, 2; v. = —v; and b,,, is computed from (100) with a, given 


by equation (36). The value of ¢‘” is given by 


eG) Lel/G)} wn 


where arg t‘” is calculated either from (7) the form ¢ & cv assumed by 
the change of variable in the critical region or from (ii) arg [—2/h]} 
and arg (—1/v,)? being equal to arg (¢ — ¢,) and arg (v — »,), respec- 
tively, on the portions of the paths of steepest descent leaving t, and 
v, . The last summation in (120) is omitted when k = 0. The expression 
for P,(v,) may be written with the help of equations (40). 

Equation (120) was checked by using it to obtain the first few terms 
in the known’® uniform asymptotic expansion for the Bessel function 
H‘” (xz) with 0 < z < 1. Here h(t) = z sinh t — ¢, the saddle points 
are at +2, (t; > 0) on the real axis, and the path of integration runs 
fromt = —o tot = © + ir. If the direction of the path of integration 
is reversed [so that (111) gives —H!” (az)], the paths L’ and L can be 
brought into correspondence by a rotation of 120°. In the approximate 
form ¢ & cv of the change of variable, arg c = 27/3; and v, corresponding 
to t, is v, = | —8h,/2 |? exp (—7i27/3). Furthermore, f(v) turns out to 
be an even function, (114) gives P,(—v) = (—1)"P,(v), and pons, 
Don+t,o are zero forn = 0,1, 2, ---. 

When ¢, and t, approach each other, h{”’, hs”, v, and v2 tend to zero. 
In this case the asymptotic behavior of the integral (111) may be 
determined with the help of the equation obtained by setting » = 3 
in equation (98). However, if one is interested in the behavior of the 
coefficients p,; , the following relations are useful. Putting v, = 0 in the 
integral (114) for P,(v) shows that in the limit 


Poo = f(0), pu = f'0), (122) 


Pio = —f0)/3! , Pu = —2f (0) /4! 
and so on. The derivatives ¢™ appearing in the derivatives of f(v) = 
g(t)£® are now obtained by differentiating 3-v? = h(t) repeatedly 
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with respect to v. The leading coefficients are found to be 
Poo = ‘ead i s [2/h;>}" 
Dor pas hens aa + go ie iad = — Ay? 4rP>/12. 


(123) 


APPENDIX F 


Saddle Point Near Branch Point 


Here we apply the theory of Sections VII and VIII to a case discussed 
by Bleistein,* namely, \ + 1 and » = 1. The paths L’, L and the func- 
tions A(t), F(v) are assumed to be the same as in Section [IX where the 
singularity at the origin was a simple pole instead of a branch point. 
In the critical region L’ is parallel to, and to the right of, the imaginary 
t-axis. Only the case in which ¢, and »v, are real and of the same sign 
will be considered. When #, and v, are positive the cut associated with 
the branch point is assumed to start out from the origin along the posi- 
tive real axis, and then quickly bend downward to run out to —70. 
When ?, and v, are negative, the cut starts out along the negative axis 
and then bends downward to —i«. 

Equation (46) and F(v) = v? — 2vyzv lead to 


J ~ V(x) a Prot ” + V,(x) > Put ” (124) 


where, with c > 0 and the path of integration lying to the right of 
the cut, 





Vola) = fo texp [aF@)] dv = tra” er a 
“ (22) 


Replacing A by A + 1 gives Vi(x). Vo(x) and V;(x) are parabolic 
cylinder functions (Bleistein,? and pair No. 740.2 in Campbell and 
Foster Table®). 


P,,(v) = Pro ss Paw 


_ 1 ff at g@e? | 1a ii (fv — og 
2721 Jo 2 ¢— », 0¢ ¢-—2, 


where C' encloses £ = 0 and € = 2. Setting n = 0 and using f(v) = 
g(t) (t/v)*-*t™ leads to 


P,(0) = Poo = f(0) = Ga to 5 a —20,/h” 
Po) = f@), i = (o/h. 


(125) 








I 
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Setting n = 1 gives 
r 
P,(0) = 53 10) — f@) + vif? (0) 
1 


1—A ay) 
Por) = Sy 40) — fo) + of en] — EB. 
1 
The values of pro and pri may be obtained by aie the steps 
outlined in the first part of Section VIII. Equation (52) and (53) 
become 


T n 
P,.W) = B E — >» (8 16mDn—m.0 as Braden.) | 
100 m=1 (126) 


1 n 
Pno = E = a (BoocPn-¢.0 + Biase=iP—oa) 
Booo q=1 


The path LZ, in (54) is parallel to the imaginary axis and passes 
through the saddle point at v = v,. The path Loy in (56) runs up along 
the right side of the cut, encircles the origin in a positive direction, and 
then runs down to v = —io along the left side of the cut. 

To get Bi we replace J on the right side of (87) by 1 + A — 1 and 
notice that a; given by (38) is 0 for the values of k used in computing 
bn. Consequently, the only nonvanishing Dyn is bo = 1 and (87) 
gives 

Bir; (—4)? 


Broo = t(ar)*v.” is a ge ae 27 


The expression for a,, obtained by replacing Gi, 3a (35) by-oy 
where g(t) = ¢\*g(t), contains [—2/h\]? which may be written as it{”. 

In equation (57) for ao;/Booo we replace Van by Onn to indicate that 
Bn iS Computed with a, given by (58) instead of the a, used in computing 
@1n/Bioo - A still different a, , given by (60), is used to compute Bo2;/Booo - 
From equation (60) all of the a,’s used to compute Bo:;/Boo0. are zero 
except a, = 1/(2v,). Therefore b,,,, is zero unless m = n, and it follows 
from (59) that 


Bori/Booo = ()i+29(20,) 7/9. 
When all of these results are used in the expression (126) for pno we 
get, with 7 for n, 


= (2)? oe oft = =? » bus: 





2. (ae 
-> oe (CA BeeWoct Sapec Aba 
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where h(0) = hh = 0, hi < 0, h®@ > 0,0 = t lt |?(—a)3, ti = 
—2v,/h$”, and b,,, is computed from (100) with a, replaced by a, where 
d, = —he*? /[k + IPAS? , k21 


When 7 = 0 the summation with respect to s is omitted. Similarly, 
P,,(v1) gives 


a 41) (2741) _ Pio 
= (- 1)’ 0; ey ie aon pa n)! 4 s Glas v1 


_ >> (=1)"G = Now [= ag 8.0 7 Pino | pi 


s=1 s! (2v,)”* vy 


where t@ = [2/h]*, gi) = t'g(A), and b,, is computed from (100) 
with 


‘a n) 








a, = —2hy**? /[(k + 2)! Ay”). 
It is interesting to notice that when A = 0, (127) and (128) give 
values of pj and pj, which agree with those obtained in eee IX. 


When the saddle point approaches the branch point, t, , 2: , hj”, and 
v, tend to zero. In this case the integral J may be evaluated with the 
help of equation (109). The behavior of the coefficient p,, may be 
studied by putting v, = 0 in the integral (125) for P,(v). It is found that 


Poo = (0), pr = f 0), (129) 
Pi0 —rf (0)/2, Pir —Q + 1)‘ (0)/12 


and so on. The derivatives ¢™) appearing in the derivatives of f(v) = 
g(t) (t/v)-*t® are now obtained by differentiating v? = h(¢) re- 
peatedly with respect to v. 

Forn = land 2, 


P= 2/hP, P= —APUrye. (130) 


Substituting these values in equations (51) for f(0) and f (0) and 
using (129) gives the limiting expressions for poo and pou. 


I 
I 


APPENDIX G 


Poisson-Charlier Polynomial 

In this appendix equations (127) and (128) for pj and pj, are used 
to obtain an asymptotic series for the Poisson-Charlier polynomial 
Cn(y, a4) when y is O(1) and both » and a are large and positive. 
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Multiplying the generating function 


(1 — u)” exp (au) = 2) en(y, a) (131) 
m=0 . 


by w+ and integrating wu around a small circle enclosing u = 0 gives 
a contour integral for c,(y, a). Instead of c,(y, a), we find it more 
convenient to deal with the polynomial (in y) 





d,ly, a) = TRL) 


Setting x = n + 1 and making the change of variable = 1 — win 
the integral for c,(y, a) leads to 


Ci(y, a). (132) 





ai, t+ ico 4 t 
d,(y, a) = Sep ck pe at, z>ytl 
So ren a ai 
= Om digs ee 
me Lae 
Qrt ; 


(133) 
h® = -rt— nd — 4) 
where r = a/x. The ratio r is positive. 
We wish to use equations (127) and (128) to compute Dro, Pai in 
the expansion obtained by dividing (124) by 272, namely 


d,(y, a) = Js V(x) De Prot + V,(2) Dy Pmt, (134) 


Here V(x) = Vo(x)/2ri with \ = y + 1. The function V(x) is obtained 
from V(x) by increasing y by 1. 
We have 


c+ico 
V(x) = / vw exp [a@? — 2v,v)] dv, C0 
ay ee (135) 
~~ PET, v4.2] 
where arg v is 0 at v = c and Gy, 2) is the parabolic cylinder func- 
tion 


G(y, 2) = 53 = u’ exp (u” — 2zu) du 
(136) 
= 97} oy St 
nl ee ‘ — n) 
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The function G is related to the function U discussed and tabulated 
in chapter 19 of Ref. 10 by the equation 


Q-ul2 


Gy, 2) = Soy exp (—42’) U[—y — 3, 224]. 
2(z) 


The saddle point ¢ = t, is obtained by setting the derivative h‘” (t) = 
—r + (1 — t)~* to zero; and the saddle point v = » is given by the 





relation vj = —h, together with the condition that v, and t, be of the 
same sign: 
1 t r 
et ee vi, = —H(r-l— inn’. (137) 
r ea 
The values of the derivatives {@) = dt/dv at v = 0 and v = v, are 
2v 
(0. --,.25 @) 1 
to = 2v,/ho res 1 


wot 


i = [yay = =. 


For k > 1 the kth derivative h“(é) is (kb — 1)'(1 — #)~* and the 
coefficients used to compute Bmn.n , Onn from (100) are 


4, = —h& + YI —7)) = (e+ YG — 0D 
and 

a, = —2(k + Itr'*?/[(k + 2)!r?] = —2r'/(k + 2), 
respectively. 

Comparison of the integral (133) for d,(y, a) with the integral (1) 
for J shows that g(é) = 1 and \ = y + 1. Consequently g(é) = #~'g(2) 
becomes 9(é) = t’. For k > 0 the derivatives are g’ = 0 and g™ = 
yy = Le Gp eee 

Setting 7 = 0 in (127) and (128), and using the results just ob- 


tained gives 
= ()" = (2) Q" _ Be 
Epos (: Sf RR Ne ay v, ) 


for the leading coefficients in the asymptotic expansion for da(y, a). 
Here ¢; and v; are given by (137). The next two coefficients, obtained 
by setting 7 = 1, reduce to 





1 1 Ol 
es {-a4 \p 


1 
yt Dy + Pal 57 ay ae Op, 


_ ayl (2) HEE — 1) _ ytr tr | 
Pi (4 res T 4 9 + 94 |- 


(139) 
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Some idea of the behavior of the series (1384) for d,(y, a) may be 
gained from Table II. Equations (127) and (128) were programmed for 
calculation on a high speed digital computer. The table lists results for 
the typical case + = 30, a = 25, and y = —5. Here Term:, = 
V(x) pat”, Termons: = Vila)pae”, and S, = t +t + es + th. 
The “exact”? value, 381.02, was calculated by using the recurrence 
relation for the Poisson-Charlier polynomials. 

No study was made to decide whether the relatively large value of 
Term, results from accumulated round-off error (an accuracy of 1 
part in 10’ was used) or whether the asymptotic scries actually starts 
its divergence around m = 5 or 6. 

When r is near unity, v; and ¢, are small and the mdividual terms 
in the expressions (127) and (128) for pjo, pj1, become large. In this 
case considerable cancellation occurs, and a high degree of precision in 
the calculations is required to obtain accurate values of pjo and pj. 

An asymptotic series (nonuniform) which is useful when r — 1 is 
small may be obtained by a variation of the classical method which is 
sometimes used in cases of this sort. Instead of using an expansion 
about both t = 0 and ¢ = ¢,, which is done (in effect) in obtaining the 
uniform asymptotic expansion, an expansion is made only about t = 
0. Thus, the exponent zh (t) may be written as 


ah(t) = [—2(r — 1)t + xt*/2] + @P)O/3 + 0/4 + ---). 


Changing the variable of integration from t to u = t(x/2)” and as- 
suming that r — 1 is so small that 2 = (r—1) (#/2)% is O(1) gives 


exp [xh(t)] = exp [—2zu + u’] exp [u°(2t/3 + 22/4 + ---)] 














(140) 
= exp [—2eu + uw] >> (2/2)? dS Bnaw”*. 
n=0 m=0 
The last series is obtained from equation (102) with é = t = u (2/r)%, 
TasLe I]—PartiaL SuMS FOR d29(—5, 25) 
m Termm Sm m Termm Sm 
0 276.62 276.62 4 0.015 381.03 
1 82.82 359 .44 5 —0.008 381.02 
2 20.19 379.63 6 0.008 381.03 
3 1.39 381.01 7 —0.144 380.88 








Exact do —5, 25) = 381.02 
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y = u?, and a, = 2/(n+2). The coefficient b,,, is computed from the 
a,’s and (100). 
Substituting (140) in the integral (133) for d,(y, a) leads to 


Ea) >, Clay Pr So ba Koc (141) 
n=0 m=0 
where 
1 et+io eg 
K, == u”” exp (wo — 2zu du) 


201 c-to 
Gn + y,2 = Gin + y, & — 1(e/2)'). 


The recurrence relation 


I 


hoe, = net Ky (142) 


permits Kon4+n in (141) to be expressed as a linear function of Ko and 
K,. Equation (142) is obtained by integrating the derivative 


oye exp (u? — 22u) = [(n + y)w* + 2u — 2z]u"*” exp (uw? — 22u) 

As r > 1 the leading term, (2/x)‘’*”’’K,, in (141) tends to the 
leading term V4(x)poo in the uniform asymptotic series (134). Although 
(141) is much simpler than (134), it does not hold for nearly as wide a 
range of values of r — 1. 


APPENDIX H 


Saddle Pownt at Origin 


Here we are concerned with the leading term when \ ~ 1 and there 
are two saddle points, one at t = 0 and the other at t = t, . We assume 
that ¢, is real and positive, and that A(é) is real on the real axis. Further- 
more, we assume h, < 0, hi” < 0, hi” > 0 so that the saddle point 
at #, is lower than the one at 0, and the paths of steepest descent at 0 
and ¢, are parallel to the real and imaginary axes, respectively. A cut 
extends from 0 to — ~ along the negative real axis. 

The paths of integration L’ and L are taken to run in from o 
exp (—t7/8), cross the positive real axis in the critical region, and the 
run out to o exp(%/3). Example (2) of Appendix B leads us to 
choose 


FQ) = 20° — 30u", = of = —h, (143) 
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with arg v; = 0. From equation (46), the uniform asymptotic ex- 
pansion is of the form 


J ~ Vola) DY Dot” + Vila) Do paz” + Vile) Dd) pet” —(144) 
n=0 n=0 n=0 
where 
Vi(z) = i v'**? exp [xF(v)] dv, t= 0,1, 2. 
ZL 


Expanding exp(—3xv,v?) and integrating termwise with the help of 


Le exp (u’) du = 2ni/| sr? a ‘) | 


Qrt po y-ra So __(—80,)"(a/4)"* 
Viola) = = ay? De (145) 
3 oka | r(1 = ON 4 20) 


from which V;(x) may be obtained by replacing A by A + 1. When A = 
1, Vo(x) reduces to the product of an Airy function and an exponential. 
Setting n = 0 in the integral (47) for P,,(v) gives 





gives 


P,(v) = Poo + Po + Dood” 


ee [e+ £@ — w) + @ — ow) 
~ Qnt de A) Pe — a) dg 


= {0) + f’Ov + [f) — fO — o4f’O)]r7?. 
The values of ¢§”, ¢{°, t{? appearing in f(0), f’(0), f(v,) are 
(1) — 60, : (2) Lo lig tg 
0 = ’ bo = —a@ AD 


ee 2 2 1 
ne? TPP 
a 
MD | St | 
1 a (2) . 
1 


The derivatives ¢‘”, t{? are positive and nearly equal when the saddle 
points are close together. 
When n = 0, the Ursell equations (61), (62), and (63) become 


Po1) = @10/Bi00 
Poo = Qo0/Booo (148) 


(146) 





(147) 


Qo1 Boo 


— Sor 1 
Bos Boro Bo10 Poo 
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The values of a19, 8100 otbained from the leading terms in the asymp- 
totic series (54) defining the a»’s and B,1m’s give 


Po@:) = gi(h/ny"a”. (149) 


Similarly, comparing the asymptotic series (64) defining the ao;’s and 
Bom’s with the series (109) leads to expressions which give 


(0) s(1)A 
Jo to ) 


(1) (0) 1 ha? (1) (A+1) 
go — A+ 1)go 30,6 TF EZ to : 


The remaining coefficient, Po2, in Po(v) may now be obtained by com- 
bining (149) and (150). The coefficients po, give the leading part of 
the desired expansion (144) for J. 


Poo = (150) 


Poi 
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Gain Control for Diversity Receivers 


By STEPHEN S. RAPPAPORT 
(Manuscript received April 18, 1968) 


Previous work on optimum gain control is extended to an important 
class of diversity receivers used for digital data transmission through fading 
media and for radar. As in the single diversity case the optimum gain 
(which yields minimum average cost of receiver saturation) is extremely 
ensensitive to relative costs of saturation at the upper and lower dynamic 
range bounds. The sensitivity to relative cost decreases as the order of 
diversity increases. 

Optimum gain and performance characteristics are given from which 
dynanuc range requirements for diversity receivers can be deduced. 


I. INTRODUCTION 


A good part of detection theory literature deals with the determina- 
tion of statistically optimum or near optimum receiver structures. 
However, in any practical implementation of these receivers the signal 
processing must be performed by components of finite dynamic range. 
To effectively use the amplitude range of a signal processing chain 
it 1s common to scale the received signal by adjusting the receiver 
gain. Optimum gain settings for minimum average cost of excluding 
(from a receiver’s finite dynamic range) the envelope of a narrowband 
signal plus gaussian noise were presented last year.1 Here similar 
results are presented for an important class of diversity receivers 
used for comunications through fading media and for radar. For 
the single diversity (4 = 1) case, these results reduce to those 
given previously. 


II. PREVIOUS RESULTS 


Consider the problem of determining the normalized attenuation, 
a to optimally scale a positive homogeneous functional, é, of the 
received signal so that the average cost, 1, of excluding é from the 
recelver’s dynamic range is minimized. It follows from Ref. 1 that 


2015 
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the average exclusion cost is given by 


t= [r.@daty [ r.@ a (1 


in which d denotes the dynamic range of the receiver (such that D(dB) = 
20 logio d, d > 1), v is the ratio of cost of saturation at the upper dy- 
namic range bound to the cost of saturation at the lower, w is a vector 
parameter determined by signal noise and channel conditions, and p, 
is the probability density function of & When the optimizing value of a 
is a stationary point of | it can be found as a real positive solution to 


Pula) = v dp,(ad). (2) 
If a, is the optimum a then the minimum average exclusion cost is 
l= P,(a,) + [1 — P.(a, d)] (8) 


in which P, is the cumulative distribution corresponding to p,. For 
vy = 1, 1 becomes the exclusion probability. Ref. 1 considered this prob- 
lem in detail for the case in which £ represents the envelope of a narrow- 
band signal plus gaussian noise received through a Rician fading me- 
dium. The results are based upon the solution of (2) for the case in 
which p, is the Rician’ probability density function defined by 


pr(é) = Eexp[-—E@’ + 7')/A(vt) vw 20 (4) 


where y is a suitably defined signal-to-noise ratio. (yas = 20 logioy). 


III. GAIN SETTINGS FOR DIVERSITY RECEIVERS 


In various diversity receivers formation of the test statistic leads 
to the generalized Rician probability density function given by 


po(R) = R(R(M)*/y)""* exp [—(R? + -¥°/M)/2\-alyR/(M)*]_ (6) 
Rk, y, 20 
M =1,2,-:- 


where Jx denotes the modified Bessel function of the first kind and order 
K and w is the vector (vy, 7). Such is the case for example in square-law 
combining M-fold diversity receivers for noncoherent frequency shift 
keyed signaling through Rician or Rayleigh (if y = 0) fading channels, 
in radar receivers using post detection square law integration of M 
pulses, f and in partially coherent diversity reception of N-ary orthogonal 

+ In these cases the functional é is the test statistic. More generally however, 


the functional used for determining the optimum gain need not be actually 
formed in the receiver. 
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signals transmitted through M independent slow Rician fading chan- 
nels.°"* 

The probability density function (5) has interesting properties. It 
can be shown that (5) is the probability density function of the 
square root of the sum of squares of M independent normalized 
Rician variates, each variate having a probability density function 
of the form (4) with y replaced by y/(M)*. For M = 1, (5) of 
course reduces to the Rician probability density function (4), and 
if in addition y = 0 it becomes the Rayleigh probability density 
function. The density (5) can be viewed as the probability density 
function of the square root of a noncentral chi-square variate with 
2M degrees of freedom and noncentrality parameter y?. With y = 0 
it becomes the density of the square root of a chi-square variate with 
2M degrees of freedom. 

In many practical cases y? is proportional to the ratio of the total 
specular energy received via the M diversity branches to the sum of 
the scatter and noise energy received via any diversity branch (as- 
suming that this latter sum is the same for any diversity branch). 
Thus y?/M can be thought of as the power signal-to-noise ratio per 
diversity branch or per pulse in the case of time diversity if, as is 
commonly assumed, the diversity branches are statistically independ- 
ent but have identical parameters. 

Since (5) arises in various applications, let us consider the canonical 
problem. Specifically the solutions to (2) will be obtained in which p, 
is given by (5). Letting a = A(2)' and a = y/(M)' leads to the following 
transcendental equation for A: 


A® = Al + [(M + 1)/2] Ao + 1/@ — 0) 
-{In In-:[aA d(2)*] — In Iy-1[eA(2)*]} 
(6) 
in which 


2Ind 
At ae (7) 


determines the optimum required attenuation for the Rayleigh case 
with unity cost ratio (v = 1), and 








ava om. 8) 


For the single diversity case (M = 1), (6) reduces to the trans- 
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cendental equation encountered previously. One is thus led to seek 
an iterative solution along the same lines. Before obtaining the 
required iteration equations consider some properties of (6). The 
equation can be written in the form 


A? = Aj + MAg + [1/@ — 1] {In [(@A d(2))"""'I-@A d(2)4)] 


— (M — 1) nd’ — In [(@A(Q)4)""Ty(@AQ))]}. (9) 
Combining the logarithmic terms on the right side of (9) and using 
the fact that In(Z) — (Z*)/2*K! as Z — 0 it is seen that the quan- 
tity in braces goes to zero as y — 0. Hence the optimum attenuation 
for the chi-square case (y = 0) is determined explicitly by A = A,, 
where 


At. = AZ+ MA}. (10) 


In the same manner it is seen that if A?, = 0, then A = 0 is a solution 
to (9) for any y and d. It is easy to show that the right hand side of (9) 
is an even function of A having a minimum of A?, at A = 0. The left 
hand side of (9) is of course a standard parabola centered at the origin. 
These curves (2) do not intersect if A?, < 0, (zz) intersect only at A = 0 
if A?, = 0, and (zz) intersect at positive (and negative) values of A if 
A®, > 0. Thus meaningful values of A which minimize / are stationary 
if and only if A?, > 0. From (7), (8), and (10) this requires vd’” > 1, 
which for M = 1 reduces to the constraint encountered previously.’ 

The solution to (6) or (9) can be obtained using the extrapolated 
iteration scheme described in Ref. 1. The iteration formulas require the 
derivative of the right side of (9) which can be found using the identity 


d/dt (CLAD) = PLAY) n= ++ —2,-1,0,1,2,-55. AD 
The result is 

am (2)! Tn2[A da(2)*] jsdohy 

MA) =3@- DA TaratA da(2)'} ~ Tyafdaayf 


in which f denotes the right hand side of (9) and the prime denotes dif- 
ferentiation with respect to the argument. 
For computational purposes it is convenient to define the functions 


Ve(S) = (exp —9) Ix) (13) 
which are uniformly bounded on the semi-infinite interval [0, oo]. 
For any argument ¢ these functions can be readily generated by 


GAIN CONTROL 2019 


numerical techniques using the recurrence relations and asymptotic 
expansions for the modified Bessel functions. 

Using (12) and (18) the following iteration formulas to solve 
(6) are obtained. 


Ais = F; — (A; — F,) B; =~ —1 
A? + [(M + 1/2] A? 


Ajo(2)! ; W-1(A; da(2)') 
d+1 +e — DI [in War-1(A (2) | 


G. = ae War- lA; da(2)} ] — Peale 
7 2@ — DA; VU Gy[A; daQ2)4] War [Aa(2)?] 


8; = G;/(1 = G;) G; #1 


F, 


I 


+ 
(14) 


The iteration is begun with ¢ = 1, y small, and Aj = A?, and stopped 
when | (A;4,; — A,)/A, | is less than the allowable error. By this method 
the optimum required normalized attenuation was found for various 
values of v, y, d, and MW. 

Inasmuch as the optimum attenuation satisfies the nonlinear equation 
(6) it will be helpful in interpreting results to find some useful approxima- 
tions. Accordingly one notes that for y’/M > 1 and d > 1 the second 
term in the brackets on the right side of (6) is negligible compared with 
the first. Then taking I y;(x) & exp x leads to a quadratic equation in A 
whose solution 





as Y 2 (M +1) 3 
4=emrat ly nae age aa > 
approximates the required attenuation over the range specified. 

For y’/M <1, on the other hand, one may take Iy,(x) & (2/2)"/M! 
in (6). Some further approximations and manipulation lead to the 
result 


AMA. +0°/2M)~ /M<K1 (16) 


which is exact for y = 0. 

When the solution to (6) is found for given parameters the mini- 
mum average exclusion cost can be determined. For the probability 
density function (5), (8) becomes 


1= 1 — Qule, A(2)*] + rQule, Ad(2)}] | (17) 
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where 


Que) = [6/2 exp [-( + 2)/2M alas) de (18) 


is the generalized Marcum Q-function.* 


IV. EFFECT OF COST RATIO ON REQUIRED ATTENUATION 


Since one cannot easily decide how much better or worse it is for 
the receiver to saturate at its upper limit than at its lower limit, the 
ratio, v, is generally difficult to assess accurately. It is interesting (even 
fortunate!) that here as in the single diversity case,1 the solutions 
obtained using (14) show that the optimum receiver attenuations 
for a wide range of cost ratios do not differ appreciably from those 
for the minimum exclusion probability case (v = 1). 

For given v, y, d and M one may define the sensitivity, S,, of the 
optimum attenuation to the cost ratio by the difference in required 
attenuation between the given case and the corresponding minimum 
exclusion probability case. Specifically, the sensitivity to cost ratio is 


S.(v, y, d, M) = 20 logio Ay, y, d, M) 

— 20 logi, A@ = 1, 7, d, M) (19) 
in which the functional dependence is shown explicitly. In (19) a positive 
value of S, indicates an increase in required attenuation compared with 
the minimum exclusion probability case. It can be shown that (2) the 
sign of (19) depends only on »v (positive if vy > 1, negative if » < 1), and, 
(at) | S.(v, 7, d, M) | S | S.(»,0,d, M) |. Thus, one can define a maximum 
sensitivity 

S* & §.(v,0,d, M) = 10 logy, A2, — 10 logy, MA} (20) 
where the second equality follows from (19) and (10). Using (7) 

and (8), (20) can be written 
S* = 10 logy [1 + (an/2MD)] (21) 

in which 

Vas A 20 logio v (22) 
is the cost ratio expressed in dB. S* is an easily calculated bound which 
gives, with the correct algebraic sign, the maximum change (in dB) of the 


optimum required attenuation from the optimum for the minimum ex- 
clusion probability case. Figure 1 is a plot of S*. It follows from (21) 
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Fig. 1— Maximum change in required receiver attenuation caused by nonunity 
cost ratio. 


that for cost ratios in the range —MD S van S 2MD, the maximum 
change in required receiver attenuation is less than about 3 dB. Equiva- 
lently, with y, v, and d fixed, the maximum sensitivity, S* decreases as 


the order of diversity, M, increases. 
Since the optimum attenuation is extremely insensitive to cost 


ratio for typical parameters, the minimum exclusion probability 
case (v =1) is of special import among all average cost criteria of 
the form (1). The numerical results presented in this paper, there- 
fore, include only the case v = 1, although the formulas derived apply 
more generally and can be used to generate numerical results in an 
entirely similar manner. 


V. EFFECT OF DIVERSITY ON REQUIRED ATTENUATION 


It is interesting to consider how the order of diversity affects the 
optimum required attenuation. Accordingly, in a manner analogous 
to (19) one can define the difference in required receiver attenuation 
resulting from diversity by 


Sn(Y, 7’ d, M) = 20 logio A(y, Y d, M) 


— 20 logy AQ, y,d, M = 1). — (2) 
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Let 
A; y, d, M) = 20 logio A(y, y, d, M) (24) 


be the required normalized attenuation in dB. Then (23) can be 
written 


S,(, y, d, M) = A,(, y, d, M) AQ, y, d, 1). (25) 


Values of S,,(1, y, d, M) were obtained for various y, d, and M 
using (14) and (23). These are shown in Fig. 2 where all quantities 
except M are in dB. The optimum normalized attenuation A, (1, y, 
d, M) required for the minimum exclusion probability case can be 
found using (25). Specifically one finds S,,(1, y, d, M) from Fig. 2, 
and adds to it the quantity A,(1, y, d, 1) from Fig. 6 in Ref. 1. 
Notice that in Ref. 1 only the single diversity case (MV = 1) was 
considered so that the functional dependence of A, on M was sup- 
pressed in the notation. That is, A,(v, y, d, 1) here is identical to 
A,(v, y, d) in Ref, 1. 

From Fig. 2 it can be seen that if y is sufficiently small (or large), 
Sm is positive (or negative) so that more (or less) attenuation is 
required if multiple diversity is used than would be required if the 
same specular energy were concentrated in a single diversity branch 
or pulse. Also for sufficiently small (or large) y the required attenua- 
tion increases (or decreases) as the order of diversity, M, increases. 
There is of course a transition region which bridges the above cases 
and in which, for y fixed, the differences S,, cross one another depend- 
ing on the particular values of M, and D (and, in the general case, 
v). The curves for yan =. 15, for example, exhibit this behavior. 

Using (10) and (16) it can be shown that for y — 0 


S,v, 7, d@, M) © 10 logi, M 
1+ bn/2MD) | E + eH) | 
10 1 y) 
TF Gax/2D) JF OE LTE Gea) | OO 
which is exact for y = 0. For the minimum exclusion probability 


case and y = 0 (26) yields S,, (1, 0, d, M) = 10 logio M which is in- 
dependent of d. Similarly it can be shown using (15) that 


+ 10 logio | 


lim S,,(”, y, d, M) = —10 log, M (27) 
y>0 
which is independent of v and d. The differences S,, for yaa = +0 
therefore appear as horizontal lines in Fig. 2. One also observes that 
over the range of parameters shown, the limit (27) is approached within 


7, M=1) 


\ 


', 


1, ¥,0,M) - Ap (v 
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FLEE A 
\ 
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Fig. 2— Differences in required optimum attenuation 
resulting from diversity. y = 1 
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0.5 dB for yas = 30. Noting that for y° < y < y*, S,, is bounded by 
S,r(v, a; d, M) < Srl, Y d, M) < Sn(v, ve d, M) (28) 


it follows from (26) through (28) that for the minimum exclusion proba- 
bility case all the differences S,,(1, y, d, M) lie between two horizontal 
lines in Fig. 2 determined only by the order of diversity. | S,.(1,7,d,M) |S 
10 log,. M. The difference between the required optimum attenuation 
for dual diversity and that required for single diversity with the same 
total received specular energy is less than about 3 dB. 


VI. EXCLUSION COSTS FOR DIVERSITY RECEIVERS 


The optimum normalized attenuations obtained using the iteration 
equations (14) were used to obtain the minimum average ex- 
clusion costs (17) for the case y = 1. These are shown in Fig. 3(a) 
and for smaller values of D in Fig. 3(b). The generalized Q function 
(18) was evaluated by computer, using relations derived from those 
given by Sagon.® 

It can be seen that for small values of y, the smallest dynamic range 
D required to obtain a given exclusion probability decreases rapidly 
as the order of diversity is increased; the most substantial decrease is 
obtained in going from single to dual diversity. This trend is lessened 
as the available signal-to-noise ratio y increases. As a matter of fact 
if y is sufficiently large (for example, y = 20 dB) the dynamic range 
required to achieve a given exclusion probability increases as M 
increases. However at the large values of y where this latter effect is 
apparent, even modest values of D yield extremely small exclusion 
probabilities. Moreover on the types of channels where diversity re- 
ceivers are useful one would generally encounter small values of y. 

Consider a diversity receiver operating in a small signal-to-noise 
ratio and let the dynamic range of the components used be such that 
the probability of excluding the signal at any point in the receiver is 
the same throughout. Then it follows from Fig. 3 and the foregoing 
discussion that the dynamic range required of the components used 
in the post-combining portions of the receiver may be considerably 
smaller than that required of those components used in the individual 
diversity branches. 


VII. SUMMARY AND CONCLUSIONS 


An important class of diversity receivers used for communications 
through fading media and for radar is considered. The required gain 
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Fig. 3— Minimum exclusion probabilities for diversity receivers, 
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is determined which minimizes the average cost of excluding from a 
finite dynamic range the signal appearing in the post-combining por- 
tions of the receiver. For the single diversity case (MM =. 1) the results 
reduce to those given previously. 

It is shown that the required receiver gain is extremely insensitive 
to the relative costs of saturation at the upper and lower dynamic range 
bounds, differing at most by about 3 dB from the optimum for the (equal 
cost) minimum exclusion probability case for relative costs in the range 
—MD S vaz S 2MD. One also finds that the sensitivity to relative cost 
therefore decreases as the order of diversity increases. 

The difference between the required optimum receiver gain for 
various orders of diversity M, and that required for a single diversity 
receiver having the same total received specular energy is considered. 
Exact differences are given for the minimum exclusion probability 
case, and it is shown that these are less than 10 logi9M dB independent 
of other parameters. Bounds on the difference are also given for non- 
unity cost ratio. 

Performance characteristics derived show minimum exclusion prob- 
abilities obtainable as a function of dynamic range for various signal- 
to-noise ratios and orders of diversity. For a small signal-to-noise 
ratio the dynamic range required of the components used in the post- 
combining portions of the receiver can be considerably smaller than 
that required of those components in the individual diversity branches 
in order to achieve uniform exclusion probability throughout. 

Notice that in some applications the normalization assumed in 
writing (4) and (5) may depend upon M and y. This fact must be 
accounted for if one is calculating the actual required attenuation 
from the required normalized attenuation discussed in Sections IV 
and V. The optimum exclusion costs however, depend on the normal- 
ized attenuation and not on the normalizing factor. The results of 
Section VI therefore apply directly. 


VIII. ACKNOWLEDGEMENT 
I offer my thanks to A. Anastasio, who helped program the com- 
puter to evaluate the generalized Q-function. 


REFEREN CES 


1. Rappaport, 8. S., “Communications and Radar Receiver Gains for Minimum 
Average Cost of Excluding Randomly. Fluctuating Signals in Random 
Noise,” B.S.TJ., 46, No. 8 (October 1967), pp. 1753-1773. 

2. Rice, 8. O., “Mathematical Analysis of Random Noise,’ in Selected Papers 


GAIN CONTROL 2027 


on Notse and Stochastic Processes, ed. by N. Wax, New York: Dover 
Press, 1954, p. 239. 

3. Helstrom, C. W., Statistical Theory of Signal Detection, New York: Perga- 
mon Press, 1960, pp. 171-174. 

4, Lindsey, W. C., “Error Probabilities for Partially Coherent Diversity Recep- 
tion,” IEEE Trans. Comm. Theory., COM-14, No. 5 (October 1966), 
pp. 620-625. 

5. Sagon, H., “Numerical Calculation of the Incomplete Toronto Function,” 
Proc. IEEE, 54; No. 8 (August 1966), p. 1095. 


Multimoding and its Suppression 
in Twisted Ring Counters 


By W. BLEICKARDT 
(Manuscript received March 21, 1968) 


Many digital systems, such as PCM systems, data processing and data 
transmission systems, use twisted ring counters. Most of these twisted ring 
counters are subject to multimoding. This paper develops tools and methods 
for predicting all possible modes in twisted ring counters, and derives a 
general solution for suppressing the wrong modes. Suppression ts ac- 
complished by adding a few circuit connections from the output of certain 
stages to the input of another stage. The paper derives the number of neces- 
sary connection lines and their connection points for the various types of 
counters. 


I. INTRODUCTION 


Twisted ring counters of various types have been used for many 
years, and have been described in many publications.~> They are 
designed for creating a well-defined periodic pulse pattern. But they 
all have one problem in common: under certain circumstances they 
can multimode, that is, they can create undesired patterns. Hach 
mode of a counter creates a particular pattern. Only one of these 
modes is the desired one, the “correct mode;” the rest are all “wrong 
modes’? and must be suppressed. To the knowledge of the author, 
none of the publications on twisted ring counters presents a rigorous 
treatment of the problem of multimoding, although it must have 
shown up in many instances and often was solved empirically.> The 
lack of a general theory on possible modes in twisted ring counters 
and on the prevention of undesired modes led to this investigation. 

Terminology for the characterization of modes, and relations be- 
tween the parameters, make it easy to find the entire set of possible 
modes for any twisted ring counter. There is a method for suppressing 
all wrong modes by adding a few circuit connections, and a general 
formula that indicates these additional connections for any individual 
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ring counter. The method for suppressing all wrong modes in any 
twisted ring counter is summarized in Section 5.5. 


II. OPERATION OF TWISTED RING COUNTERS 


A twisted ring counter consists of a shift register whose output is 
fed back over a twist to its input in a ringlike manner (Figs. 1, 2, 
and 3). An input clock keeps a certain pattern circulating around the 
ring. In the correct mode the stages create the desired pattern by 
switching on sequentially with subsequent clock pulses, and then 
switching off in the same sequence (part a of Figs. 1, 2, and 3).* With 
each clock pulse only one stage is switching. A counter with n stages 
creates a periodic pattern with a period of 2n time slots as shown in 
the first three figures. Some possible implementations of counter stages 
are shown in Fig. 4, using AND gates, NAND gates and set-reset flip- 
flops. Equivalent stages can be built by using or gates and nor gates, 
or any custom-designed circuit. 

There are two general types of twisted ring counters: single-phase 
counters with one input clock line (example in Fig. 1), and double- 
phase counters with two input clock lines supplying interleaved pulses 
(examples in Figs. 2 and 3). Many of the single-phase counter stages, 
such as the ones shown in Fig. 4a and b, require short input clock 
pulses to prevent racing. The clock pulses must be shorter than the 
propagation delay of one stage. An example of a stage that does not 
require short clock pulses is shown in Fig. 4c.* Double-phase counters 
permit the use of simple gated set-rerest flip-flop stages (Fig. 4d and e) 
without the restriction of short clock pulses. Notice that in counters 
with an even number of stages (Fig. 2) the two clock phases are dis- 
tributed in a different way from those in counters with an odd num- 
ber of stages (Fig. 3). 

The problem of multimoding arises whenever more than one mode 
can exist. In that case, errors can switch the counter to other (wrong) 
modes with undesired patterns. Such errors can be created by noise 
transients, aging components, marginal design, and so on. The first 
three figures show some examples of wrong modes. In general, the 
number of wrong modes possible increases with the number of stages 
of a counter, and is higher for single-phase counters than for double- 
phase counters. To design reliable circuits, one must prevent un- 


* The numbers in parentheses in Figs. 1, 2, and 3 are a symbolic notation for 
different modes; they indicate the numbers of time slots a particular counter 
stage remains in one state. This notation is explained in Section III. 
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Fig. 1 — Single-phase twisted ring counter with five stages. 
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(b) 





Fig. 2— Double-phase twisted ring counter with even number of stages (six 
stages). 


desired patterns from circulating for more than a very short time 
(typically less than one counter period). 


III. GENERAL CHARACTERIZATION OF MODES 


There is a unique way in which a pattern, that is, a sequence of 
states 0 or 1, is circulated around the counter ring. Any pattern is 
shifted by one stage per time slot, as can be seen from the pulse dia- 
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Fig. 3— Double-phase twisted ring counter with odd number of stages (five 
stages). 
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Co«x-— SHORT PULSES REQUIRED 
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Co=<-[° TRIGGERS ON EDGE 





Fig. 4— Implementation of counter stages with AND gates, NAND gates, and 
set-reset flip-flops; a, b, and c for single-phase counters, d and e for double- 
phase counters. 


grams in Figs. 1, 2 and 3. The state of the last stage appears in in- 
verted form at the first stage in the subsequent time slot. For a 
counter with n stages, the pattern, seen as a time sequence at each 
stage, repeats itself in inverted form after n time slots; the whole 
counter period is 2n time slots long. 

This well-defined behavior allows us to reconstruct the entire pulse 
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diagram for a particular mode, if we only know the states of all n 
stages at any one time, or if we know a sequence of n states at any 
single stage. Therefore, a sequence of n binary digits uniquely de- 
scribes a mode. 


3.1 Definitions 

(7) We will call the state of a particular stage in a particular time 
slot an element. An element can have a state 0 or 1. 

(72) Elements in successive time slots, or in successive stages that 
have the same state, form a logic group. 

(2) The size of a logic group (g;) is the number of its elements. 

(tv) The smallest size logic group of a particular mode has gin 
elements. 

(v) The positive direction of a sequence of elements corresponds 
to the sequence as observed on the positive time axis. This corresponds 
to a sequence backwards through the stages. (This can be illustrated 
with Fig. 1b. The sequence 1 1 1 0 1 appears at stage S; in the time 
slot sequence t,, te, ts, ts, ts, and it appears at time ts in the stage 
sequence Ss, Sz, S3, S2, S1.) 


3.2 Description 

For describing one particular mode, it is sufficient to write the size 
and sequence of the logic groups g; that are built by n elements. The 
following symbolic notation is used: 


(1 + 92 +s + +++ +92) 


where 


> 9; = n = number of stages 


zx = odd number. 


For example, (3 + 1 + 1) denotes a mode of a 5-stage counter, with 
three logic groups, the first containing three elements, the second and 
third containing one element each (shown in Fig. 1b). 

This symbolic notation describes one half of the periodic cycle. 
Since each half is always the complement of the other half, the ele- 
ments of the first and the last logic group in the mode notation have 
the same state. Therefore, the number x of logic groups in this nota- 
tion is always an odd number. This is illustrated with a 7-stage coun- 
ter, for which a time sequence of states, as observed on the oscillo- 


2036 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1968 
scope connected to one of the stages, may look like this: 


counter period = 14 states 
$< ___________—_—» 
first half | second half 
<_—_—__—_> | <__—_—__> 
time sequence: - -O0O11/00001001111011/000- - 


logic groups: 4 1 2 4 1 2 
(Ap aD) 
mode notations: (hota 4) 
(2+4-+ 1) 
(Ate 19), 


The period consists of 2n = 14 states. Describing this particular mode, 
the logic groups built by n = 7 elements can be written in three differ- 
ent ways: (4+ 1+ 2), (1+24+ 4), and (24+ 4+ 1). 

These mode notations are cyclic permutations. Hence they are 
equivalent and describe the same mode. The 7-stage counter could 
have another mode with the same set of logic groups. This different 
mode can be described by the following three equivalent mode nota- 
tions: (4+2+1),(2+1+4 4), and (1+ 4+ 2). If a certain wrong 
mode can exist, all possible permutations can exist also. 


The correct mode always is the one with x = 1, that is, with one single 
logic group of size n. All other possible modes with = 3 are wrong 
modes. 


IV. PREDICTION OF POSSIBLE MODES 


4.1 Possible Logic Groups 


Not all possible partitions of n into an odd number = of logic croups 
result in a possible mode, because there are some restrictions in pos- 
sible logic group sizes g; for the different counter types. 

In single-phase counters, the logic groups can have any even or 
odd number of elements, up to n, since in any time slot, either a “1” 
or a “0” can be shifted from any stage to the following stage (Fig. 1). 
This is not so in double-phase counters. 

In double-phase counters with an even number of stages (Fig. 2), 
a clock pulse A can shift either a “1” or a “OQ” to any odd-numbered 
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stage from the preceding stage, and a clock pulse B can shift either a 
“1” or a “0” to any even-numbered stage from the preceding stage. 
This results in the restriction that only logic groups with an even num- 
ber of elements can appear in a possible mode. 

In double-phase counters with an odd number of stages (Fig. 3), a 
clock pulse A can shift a “1” to any odd-numbered stage and a “0” 
to any even-numbered stage, and a clock pulse B can shift a “0” to 
any odd-numbered stage and a “1” to any even-numbered stage, 
always from the preceding stage. This results in the restriction that 
only logic groups with an odd number of elements can appear in a 
possible mode. 


4.2 Examples of Possible Modes 

We are now able to predict all possible modes of a twisted ring 
counter with n stages by breaking n into an odd number of logic 
groups in all possible ways, taking the restrictions of possible logic 
group sizes into account. This is shown in three examples. 

Example 1: A single-phase counter with n = 6 stages can have six 
different possible modes: 


(6) correct mode 
4+1+1) 
(3:2 2'-P 1) 
(8 + 1+ 2) wrong modes. 
(Qi 22) 


QP isp dq t sp 1) 


In this counter type, the logic groups can have an even or odd number 
of elements. 

Example 2: A double-phase counter with an even number of n = 
6 stages (Fig. 2) has only two possible modes: 


(6) correct mode 
(2 + 2 + 2) wrong mode. 


In this counter type, the logic groups can only have an even number 
of elements. Because of this restriction, there are always fewer wrong 
modes than in a single-phase counter with the same number of stages. 

Example 3: A double-phase counter with an odd number of n = 9 
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stages has ten different possible modes: 
(9) correct mode 
(Cad 1) 
(5+3-+4 1) 
(5 + 1+ 3) 
Ob leb dob 1 I) 
(8 + 3 + 8) wrong modes. 
8+3+1+1+1) 
B+1+3+14+)) 
Capi Dk Pee be bpd) 
eer beep iP dae ee ae 


In this counter type, the logic groups can only have an odd number 


of elements. In general, the higher the number n of stages, the higher 
is the number of wrong modes. 


4.3 Experimental Verification of Predicted Modes 

Many counters of the three types shown in Iigs. 1, 2, and 3 have 
been built, with various numbers of stages, and with different types of 
stages, including all types shown in Fig. 4. All of the predicted modes 
for these counters have actually been observed. Any desired mode can 
be induced by presetting all stages before turning the clock pulses 
on, but only the possible modes will be able to circulate without being 
altered. 


V. SUPPRESSION OF WRONG MODES 


All wrong modes can be suppressed by adding a certain small num- 
ber of circuit connections. A general method for finding the necessary 
and sufficient additional connections for any twisted ring counter is 
to find criteria that are common to all wrong modes but do not ap- 
pear in the correct mode. By suppressing these criteria, all wrong 
modes will be prevented. To find these common criteria, it is useful 
to define the concept of common logic groups. 


5.1 Common Logic Groups 


For a particular counter, the common logic groups represent the 
set consisting of the smallest logic groups (gmin) from each wrong 
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mode. For example, the 9-stage double-phase counter, whose wrong 
modes are listed in example 3 of Section 4.2, has two common logic 
groups of sizes 1 and 3. Hach wrong mode contains at least one of the 
common logic groups. Taking the gmin-values of all wrong modes as 
common logic groups results in the smallest possible set of logic 
groups with the property of each wrong mode containing at least one 
of these logic groups. 

The size of the smallest common logic group (m,;) is equal to the 
smallest Qinin-value of all wrong modes, gmin min. That is 


Jmin min = 1 for single-phase counters, 
= 1 for double-phase counters with odd number of stages, and 
= 2 for double-phase counters with even number of stages. 


Jmin min 

Jmin min 

The size of the largest common logic group (mo) is equal to the 
largest gmin-value of all wrong modes, that is, gmin maz ! 


z 
Mo = Omin maz of > 9; =n 


f=1 


with « = 3 for wrong modes. Every possible partition of the above sum 
represents a possible wrong mode with a certain value g,,;,. The maxi- 
mum of this value for all possible partitions is min mar - 1b occurs with 
the minimum value of x = 3 and is 


min max < n/3. 
The largest common logic group is therefore 


M S n/3, (1) 

the next possible logic group size equal or less than n/3. This is 
Mm. 2 (n — 2)/3 for single-phase counters, (2) 
M, = (n — 4)/3 for double-phase counters. (3) 


This results is only a single p-value in each case, when the restric- 
tions of possible logic group sizes are taken into account. Combining 
the latter and expressions (1), (2), and (3) into a single expression, 
we get for the largest common logic group mo: 


n 2 
My = BA 0O<AK<1 (4) 
with A chosen to make mp an integer, and 


m = number of stages 
p =1 for single-phase counters 
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D 2 for double-phase counters 
My, = 1, 2, 3,4, °°: for single-phase counters 
mM, = 2,4, 6, 8, --- if m = even } for double-phase counters. 
m = 1,3, 5,7, --: ifn = odd 


The set of common logic groups for a particular counter consists 
of the smallest and the largest common logic groups and all possible 
sizes of logic groups between. It is given in Table I for counters up 
to 20 stages. For single-phase counters with two stages and for double- 
phase counters with two or four stages there are no common logic 
groups, since these counters do not have any wrong mode. 


5.2 Suppressing the Common Logic Groups 


Suppressing all common logic groups in a counter leads, by defini- 
tion, to the prevention of all possible wrong modes, and does not in- 
troduce any new modes. This section shows that there is a subset of 
common logic groups (Table II) whose suppression is sufficient for 


TaBLE I— Common Logic Groups 
(Common logic groups are all different gmin values of all wrong modes) 


For double-phase counters 


Number of For single-phase |S 
stages counters With even With odd 
number of number of 
n stages stages 
9 oat a 
3 1 1 
4 1 —_ 
5 1 1 
6 21 2 
7 21 1 
8 21 2 
9 321 31 
10 321 2 
11 321 31 
12 4321 42 
13 4321 31 
14 4321 42 
15 54321 631 
16 54321 42 
17 54321 531 
18 654321 642 
19 654321 53] 
20 654321 642 
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suppressing all common logic groups and is thereby sufficient for 
preventing all wrong modes. 


5.2.1 Method of Suppressing a Group 

If we want to suppress a particular common logic group of size mj, 
we must prevent one of the following two patterns consisting of an 
undesired sequence of ones and zeros 


---001111111100--:- 

or (--LIODODODOOOO0OOI1I-::-:) 
Re 
m, elements 


from circulating around the counter ring. The inverse pattern, in paren- 
theses, always appears with the first one. This suppression can be 
accomplished by preventing stage S, from switching from “0” (‘17’) 
to “1” (“0”) whenever stage S,-1-, is in state ‘0” (‘1’). The position 
of the patterns immediately before suppression is: 


Susgeaei S21 roe 


i vat 
~-0O011LT1L1111100--- 


or (--L1LO0OO0OO00000011-:-:-). 
_—_——_——ooe > - - 
m,; elements 


If stage S, does not switch to “1” (“0”) with the next clock pulse, the 
logic group of size m; is prevented from passing through stage S,. It 
is sufficient to suppress only one of the two patterns, since the in- 
verse of itis then suppressed automatically. 


This suppression can be implemented by adding a circuit connection 
from the output of stage S,-,-m, to the input of stage S, , preventing 
S, from switching from “0” to ‘1 whenever S,-:~n,; 18 in state “0.” 
This circuit connection, shown in Fig. 5a, bridges m; stages, and there- 
fore is called a “‘bridging connection’; its associated parameter m; is 
called a “‘bridging parameter.”’ 

The bridging connection could also be made on the inverse side of 
the stages S,-,_,,, and S,, thus preventing S, from switching from 
“1” to “0” whenever S,_1-,,; 1S in state “1.’’ These two bridging con- 
nections are equivalent, and one of them is sufficient. However, if both 
connections are applied for each m,-value, a wrong mode is cleared 
within half a counter period instead of a full period. 
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BRIDGING CONNECTION 








my STAGES 


(a) 


GATED SET-RESET 
FLIP-FLOP STAGES: BRIDGING CONNECTIONS 


FOUR-GATE STAGES: 
(WITH NAND GATES) 


SIX-GATE STAGES: 
(WITH NAND GATES) 


* INPUTS FOR ADDITIONAL (d) 
BRIDGING CONNECTIONS 


Fig. 5— Suppression of wrong modes by adding bridging connections, bridg- 
ing m: stages. Part a shows the principle; b, c, and d show an example with 
the two bridging connections m = 3 and m = 1 for counters with different 
types of stages. 


S, may be any particular stage of the counter, but it should be the 
same stage for all bridging connections (although this is not essential 
with many counters). The correct mode is not affected by this inhibition, 
since in the correct mode S,-;-m, 18 always in state ‘'1”’ (“0”) when 8S, 
is switched from ‘‘0” (‘‘1’’) to ‘1” (‘0’) because m; is always smaller 
than n. 
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5.2.2 Automatically Eliminated Wrong Modes 


If we suppress one common logic group of size m, by using the 
described method, we prevent the pattern 


mM; 


and hence eliminate not only the wrong modes containing the com- 
mon logic group m,;, but also all other wrong modes that show this 
pattern at any one position. 

The remaining wrong modes, which do not contain this pattern, 
require additional steps for their prevention. It can be shown that if 
a mode with gmin < m, satisfies both of the following two conditions, 
it does not contain the above pattern and therefore is not eliminated 
by suppressing ™m;: 


(z) All possible sums of the elements of an even number 2v of con- 
secutive logic groups must be Sm; for at least one value of v (v = lor 
2or3 --:). 


(zz) All possible sums of the elements of an odd number 2v + 1 of 
consecutive logic groups must be =m, -++ 1 for the same value of v that 
satisfies condition 7. 


Example: Suppose we have a single-phase counter with 19 stages, 
and we suppress the common logic group of size m = 6 by adding 
a bridging connection bridging 6 stages as shown in Fig. 5a. Would 
the mode (2+8+3+1+5+1-+ 4) be suppressed? 

We check whether this mode satisfies both conditions. Condition 7 
is satisfied with v = 1, since all pairs of consecutive numbers in the mode 
notation (2 + 3,3 +3,3+1,14+5,5+1,1+ 4,4 + 2) sum up 
to $6. That is, all sums of the elements of a pair (2v) of consecutive logic 
groups are Sm; . Condition z could not be satisfied with v > 1 in this 
example. Condition 77 is also satisfied with v = 1, since all triplets of 
consecutive numbers in the mode notation (2 + 3+ 3,3 +341, 
34+1+4+5,1+54+1,5+14+4,1+4+2,4 +2 + 8) sum up to 
= 7. That is, all sums of the elements of a triplet (20 + 1) of consecutive 
logic groups are =m; + 1. 

The above mode satisfies both conditions, and therefore would 
not be eliminated by suppression of the common logic group of size 
m= 6. 
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5.2.3 Sufficient Subset 


Suppression of a particular common logic group of size m; generally 
does not prevent wrong modes with gnin > m:, but it does prevent 
some of the wrong modes with gmin < m;. In the remaining unsup- 
pressed modes with gmin < m;, which all satisfy the two conditions 
stated in Section 5.2.2, the largest possible gmin-value, called gmin maz; 
follows from condition 1: 


28 
D9: Sm; (v = 1lor2or3--:-). 
7=1 


Every possible partition of this sum delivers a value gm. The maxi- 
mum of these gmin-values for all possible partitions is gmin maz. It oc- 
curs with the minimum value of v = 1 and is 


Jmin maz Ss m,/2. 


This is the next lower common logic group size m+, that must 
be suppressed: 


Misi S m;/2. (5) 


M+, is the next possible logic group size equal to or less than m,/2, 
which is 


Mis, 2 (m; — 1)/2 for single-phase counters, (6) 


Misr 2 (m; — 3)/2 for double-phase counters. (7) 


This results in only a single m;41-value in each case, when the re- 
strictions of possible logic group sizes are taken into account. 

Combining the restrictions and the inequalities (5), (6), and (7) 
into a single expression, we get for the next lower common logic 
group m,; 4 that must be suppressed: 


m; 
2 





—@p-— 7A OSA 


IIA 
ay 


(8) 


Nua = 


with A chosen to make m,,, an integer, and 


p=1 for single-phase counters 
p=2 for double-phase counters 
Ming =-1, 2, 3, 4, for single-phase counters 
Mia1 = 2, 4, 6, 8, +++ if m = even 
ie iL o er ei ay Seda for double-phase counters. 
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If we suppress m,; , it is sufficient to supress m;,, as the next lower 
common logic group, since suppression of m; prevents all wrong modes 
with m; 2 Ymin > Mi+1. Recursion formula (8) determines the maxi- 
mum spacing of successive common logic group sizes m; to be suppressed 
for sufficiently suppressing all common logic groups within the covered 
range. By extending this range from the largest common logic group 
m to the smallest common logic group m, , we get the sufficient subset 
of common logic groups 


Mo» My, Me, °°"? > M 


that must be suppressed for preventing all wrong modes. mp is de- 
termined by expression (4); m, through m, are obtained by expres- 
sion (8). 


5.2.4 Necessary Subset 


The m,-values resulting from expressions (4) and (8) 
Mo» My, M2, °° Mey rr? y My 


always represent a sufficient subset of common logic groups to be sup- 
pressed for preventing all wrong modes. But for some particular counters, 
the necessary subset 1) , m,, M2, °°* ,m; may be smaller by a few m;- 
values. That is, the smallest values m,., +--+ m, of the set are not neces- 
sary. There is not a simple expression like (4) and (8) for giving only 
the necessary m;-values but, for a particular counter, they may be 
found by using the two conditions in Section 5.2.2, which have not yet 
been used to their full extent in Section 5.2.3. In a first step, the last 
value m, is left off and a check is made whether any wrong mode exists 
that could satisfy both conditions for the remaining m,-values. Such 
modes can be found by listing all possible combinations of logic groups 
that satisfy those two conditions (for 1 S v S m,/2). If there is no mode 
consisting entirely of these listed combinations, m, is not necessary. In 
the next step, m,-; is left off, repeating the procedure, until the last 
necessary value m, is found. 

For counters up to 20 stages, Table II gives the sufficient m,-values 
(bridging parameters) according to expressions (4) and (8), with 
the unnecessary ones in parentheses. 


5.3 Implementation in Different Counter Circuits 


Each bridging parameter m; denotes one bridging connection, bridg- 
ing m, stages, which has to be added to prevent wrong modes (as 
described in Section 5.2.1 and shown in Fig. 5a). Figures 5b, c, and d 


2046 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1968 


TABLE I] — BRIDGING PARAMETERS ™; 


(Numbers without parentheses denote the necessary and sufficient bridging 
connections.) 


For double-phase counters 








Number of For single-phase —————_ 
stages counters With even With odd 
number of number of 
stages stages 

n mi mi mi 

2 * * 

3 1 1 

4 1 # 

5 1 I 

6 2 (1) 2 

7 2 1 1 

8 2 (1) 2 

9 3 (1) 3 (1) 
10 3641 2 

11 3 (1) 3 (1) 
12 AD: C1) 4 (2) 

13 42 1 3 (1) 
14 4 2 (1) 4 2 

15 5 2 (1) 5 (1) 
16 5 2 (1) 4 (2) 

17 5 2 (1) 5 (1) 
18 6 3 (1) 6 (2) 

19 6 3 I 5 (1) 
20 6 3 (1) 6 2 


* No bridging parameters because these counters have no wrong modes. 


show the bridging connections for the values m = 3 and m = 1 for 
counters with different types of stages. In counters with 6-gate stages, 
as shown in Fig. 5d, additional gates are required for proper sup- 
pression of common logic groups without impairment of the correct 
mode. For not impairing the correct mode, a feedback connection is 
required from the output of stage S,. These counters need one addi- 
tional gate if there is one bridging connection or two additional gates 
if there is more than one bridging connection. 

As an example, we obtain for a 3-stage single-phase counter only 
one bridging parameter m) = m; = 1. This means that only one 
bridging connection is needed, bridging one stage. Figure 6 shows 
three possible locations of the bridging connection. If bridging con- 
nections pass the twist, they must also be twisted, as illustrated in 
Figs. 6b and c. 

For double-phase counters with an odd number of stages, one also 
has to make sure that the signal from stage S,_;-., does not reach stage 
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Fig. 6— Single-phase counter with three stages requiring one bridging con- 
nection, m = 1. Parts a, b, and ¢ are three equivalent solutions. If a bridging 
connection passes the twist, as in b and ¢c, it must also be twisted. 


S, earlier than the signal from stage S,-, caused by the same clock pulse. 
Otherwise a pattern --- 1 +1-+1-+1+1--- might not be prevented 
under certain worst case propagation delays of the logic circuits in- 
volved. It is easy to assure this timing condition if logic gates are used 
that also provide a complementary output (as is the case in emitter- 
coupled gates). Figure 7 shows such an example with Nor/or gates. In 
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A 
© 
© 
B 





Fig. 7— Double-phase counter with three stages (with Nor/or gates). Use 
of complementary gate outputs (or output Sw instead of Nor output S:, and 
Sx instead of Sz) for increasing the permissible gate propagation delay tolerance 
range. 


the case of a wrong mode (1 + 1 + 1) in this double-phase counter 
with three stages, output S,. appears one gate propagation delay later 
than S, upon an input pulse A, and output S2» one propagation delay 
earlier than S, upon the same input pulse A. This is sufficient to meet 
the above timing condition. If complementary gate-outputs are not 
available, a small delay may be introduced into the bridging connections. 
This additional timing condition does not exist in single-phase counters 
and in double-phase counters with an even number of stages. 


5.4 Experimental Verification 


Proper suppression of all wrong modes by bridging connections 
determined according to the described procedures has been verified 
experimentally with counters of all three types (Figs. 1, 2, and 3), 
with different stages (Fig. 4) and with many different values of n. 
Counters for which the necessary set of bridging connections is 
smaller than the sufficient set resulting from the formulas were 
given special attention. 


5.5 Summary: Suppression of Wrong Modes 


A small number of additional circuit connections (bridging con- 
nections) are sufficient for suppressing all wrong modes in a twisted 
ring counter. The bridging connections are determined by the bridg- 
ing parameters m;, which can be found by the formula: 

n 2 
Mo = 3 = 3A 
Mm; 
Misi = Dy ae (p — 3) °A 
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with A chosen to make mp» and m;,, integers, and 


nm = number of counter stages 

p=l1 for single-phase counters 

p=2 for double-phase counters 
m; = I, 2, 3, 4, for single-phase counters 
MDE Oey ee a for double-phase counters 
m,;, = 1, 3,5, 7, --- ifm = odd 

z= 0,1, 2, 3, fe ase oe 


Each of the resulting bridging parameters m; denotes one bridging 
connection in the circuit, which bridges m; stages (Fig. 5). The 
bridging connection can be located anywhere in the counter ring; if 
it passes the twist, it must also be twisted. See Fig. 6. 

The resulting k + 1 bridging parameters denote a sufficient set of 
k + 1 bridging connections in every case. For certain counters, how- 
ever, the necessary set of 7 + 1 bridging connections is slightly 
smaller; it can be determined by the procedure described in Section 
5.2.4. 

Table II gives the k + 1 bridging parameters according to the 
above formula for different counter types up to 20 stages. The bridg- 
ing parameters denoting unnccessary bridging connections according 
to the above procedure are in parentheses. 


VI. CONCLUSION 


Tools and methods for predicting and suppressing wrong modes 
in twisted ring counters have been developed. As a result we have 
gained a better insight into the multimoding mechanism and ob- 
tained a simple method for preventing multimoding. This method is 
summarized, and the required additional circuit connections are given 
in Table II for counters up to 20 stages. 


VII. ACKNOWLEDGMENTS 


I wish to thank Mr. D. Koehler for many helpful suggestions and 
his critical reading of the manuscript, and Mr. W. J. Kusner for 
experimentally verifying various results of this study. 


REFERENCES 


1. Ware, W. H., “The Logical Principles of a New Kind of Binary Counter,” 
Proc. IRE, 41, No. 10 (October 1953), pp. 1429-1437. 


2050 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1968 


2. Tarczy-Hornoch, Z., “Five-Binary Counting Technique Makes Faster Deci- 
mal-Counting Units,” Elec. Design, 9, No. 2 (January 1961), pp. 34-87. 

3. Millman, J., and Taub, H., Pulse, Digital, and Switching Waveforms, New 
York: McGraw-Hill Book Co., Inc., 1965, p. 682. 

4. Boag, T. R., “Practical Applications of an IC Counter/Shift Register,” 
Elec. Design News, 10, No.7 (June 1965), pp. 18-33. 

5. “TTL Integrated Circuits: Counters and Shift Registers,’ Texas Instruments, 
Inc., Application Report SC 10660 (March 1968), pp. 28-29. 


Synthesis of Rational Transfer Function 
Approximations Using a Tapped 
Distributed RC Line With Feedback 


By DAVID A. SPAULDING 
(Manuscript received June 26, 1968) 


This paper describes a simple procedure for synthesizing an active 
distributed RC network which, by using dominant poles and zeros, realizes 
a very accurate approximation of an arbitrary stable rational transfer 
function. The network uses a single untformly distributed RC line with taps 
spaced along its length. A linear combination of tap voltages is added to the 
input signal to form the driving voltage for the RC line; the output signal is 
also a linear combination of the tap voltages. 

The network offers a number of significant advantages. Since it realizes a 
nearly rational transfer function, the approximation problem can be 
conveniently solved using readily available results on rational function 
approximation. Also, the network uses only one uniform RC line, the transfer 
function can be changed simply by changing resistor values, and the frequency 
can be scaled by minor connection changes. Thus one standard network with 
minor modification ts useful for a wide variety of applications. 

This paper develops the design procedure and derives the various sensi- 
tivity functions of importance. Two example designs are carried out: an 
approximation to a second-order low-pass transfer function and an approxi- 
mation to a second-order band-pass transfer function with a Q of 100. The 
sensitivities for the examples are very reasonable and the measurements made 
on laboratory models indicate excellent agreement with theoretical predictions. 


I. INTRODUCTION 


The progress being made in miniaturizing electronic circuits has 
stimulated a continuing interest in the synthesis of networks using 
distributed RC components. Numerous techniques are available for 
synthesizing transfer functions using distributed RC components in 
conjunction with various active network elements.t Generally, these 
synthesis procedures are applicable only if the transfer function has 
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a very special form. This form is not a rational function of the com- 
plex frequency variable s but involves hyperbolic functions of s. If 
the problem posed to the network designer were merely to realize 
given transfer functions of this special form using distributed RC 
networks, there would be no difficulty. 

However, the problem is generally not this but rather to realize a 
network which achieves certain system specifications such as band- 
limiting or pulse shaping. Thus, a realizable transfer function must 
be developed which approximates the specifications (that is, the ap- 
proximation problem must be solved) before a network can be 
synthesized. Because the transfer functions realizable by distributed 
RC networks have a somewhat complicated form, the approximation 
part of the network designer’s work is more difficult when using dis- 
tributed RC networks. This fact has led to a continuing effort to 
develop distributed RC networks which realize rational transfer 
functions. Since rational functions are easier to manipulate, and many 
applicable results are readily available in the literature, the approxi- 
mation problem is made much easier. This paper develops a simple 
procedure and network for realizing an accurate approximation to a 
rational transfer function using an active network incorporating a 
distributed RC line. 

Available techniques for synthesizing rational transfer functions 
using distributed RC networks are documented by Heizer, Barker, 
Woo and Hove, and Fu and Fu.?* Each of these techniques uses the 
fact, first demonstrated by Heizer, that some of the immittance 
parameters of a distributed RC line can be made rational functions 
of s by cutting the conducting layer of the RC line in a particular 
manner. 

These synthesis techniques have some definite disadvantages. They 
require two RC lines with cuts in the conducting layer which depend 
upon the transfer function being realized; this is undesirable from a 
manufacturing point of view and makes tuning difficult. Also, the 
synthesis procedure involves a test to determine that the curve cut 
in the conductor satisfies certain restrictions, that is, it does not “at- 
tempt” to create a negative capacitance in the line. If it does, a new 
try at the design is required. Fu and Fu eliminate this problem at 
the expense of a significant increase in circuit complexity.® 

Recently techniques have become available for approximating 
rational transfer functions by using the dominant poles and zeros of 
distributed networks. A few representative approaches are those of 
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Kerwin, Bello and Gausi, and Wyndrum. Kerwin’s approach, in gen- 
eral, requires the use of lumped components.’ Bello and Gausi con- 
sider only low-pass transfer functions and use different configurations 
to realize an arbitrary transfer function.2 Wyndrum’s technique also 
deals only with low-pass transfer functions.° 

The synthesis technique described here offers advantages over the 
other available techniques since only one uniformly distributed RC 
line is used and it is capable of realizing an accurate approximation 
to an arbitrary transfer function. In addition, the design procedure 
is very simple. 


II. TRANSFER FUNCTION OF UNIFORM RC LINE WITH FEEDBACK 


Chen and Levine*® “ have suggested that filters could be built 
using a uniform RC line driven by an input voltage source and hav- 
ing the output formed as a linear combination of the voltages ap- 
pearing along the line as in Fig. 1. This procedure is useful in some 
cases but is not general enough because it synthesizes transfer func- 
tions by using zeros of transmission. What is needed in addition to 
zeros are poles; poles can be realized by using feedback as in Fig. 2. 

The network of Fig. 2 consists of a uniform RC line with taps 
spaced along its length. The tap voltages are appropriately scaled by 
the infinite input impedance coefficients a; and added to the input 
signal to form the driving voltage for the line. The output voltage is 
the sum of the tap voltages appropriately scaled by the infinite input 
impedance coefficients b;. The RC line is the three-layer structure 
shown in Fig. 3, where it will be assumed that there is no voltage 
variation in the y direction. 

To determine the voltage transfer function of the network of Fig. 





Fig. 1— Tapped RC line. 
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Fig. 2— Tapped RC line with feedback. 


2, we first determine the voltage gain G;(s) from the input of the line 
to a point x; meters from the input. The result is* 


cosh (1 — x,)(res)? 
cosh I(res)? : 


where | is the total length of the line in meters, and r and c are the 
resistance and capacitance per meter. If the distances / and x; are 
constrained to be integral multiples* of some fixed length d, , that is, 
Ll = Ld) and x; = id) , and we let r = rcdj , G;(s) becomes 


cosh (L ~ i)(rs)* () 
cosh L(rs)? 

Using (1) the voltage transfer function of the network of Fig. 2 be- 

comes 


G(s) = 


G(s) = 


L 
>> b; cosh (L — 1)(rs)? 

Gs) = K = ; (2) 
‘> ¢; cosh (L — 1)(rs)? 
1=0 

where c = landc; = —a; forz ~ Of . The real constant K is such that 

b; = 1 for the smallest 7 for which b; + 0. By making the substitution 

p = exp (rs)? and factoring the resulting polynomials in p, it can be 


* Tor any set of 2, and J, a small enough d> can be found that error in this 
assumption is negligible. . 
fa has been set to zero which can be done without any loss of generality. 
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LLL 








Fig. 3— Uniform RC line. 
shown that (2) can be factored into the form 


Derk Il (cosh (rs)? — Z,] 
Oe Gee le ee (3) 


I] [cosh (rs)? — P,] 


where 1 S$ RF S L [unless the numerator in (2) is unity in which case the 
numerator of (3) is 2'~”] and the quantities P, and Z, are real or occur in 
complex conjugate pairs. 

Before considering the question of stability, we will determine the 
locations of the poles and zeros of G(s). Notice that, in spite of the fact 
that (s)* is involved, G(s) is single valued. To determine the pole (zero) 
locations, we set the denominator (numerator) factors in (8) equal to 
zero and solve for s. For a typical denominator factor (cosh (rs)? — P;) 
we calculate the s-plane pole positions to be 


rs; = In? |p; | — (arg p; + 2n)? + 2j In| p; | (arg p; + 2nr) (4) 


where n = 0, 1, +2, ... and p; = P; + (P? — 1)'. The term p; 
comes from the solution of a quadratic equation which has two roots. 
However, these roots are always reciprocals of one another and, as can 
be seen from the form of (4), these two values of p; give the same s; . 
Hence, only one of them need be used. A simple check shows that each 
of the poles resulting from the single term (cosh (rs)? — P,) as given by 
(4) is simple.* 

When P, is real, the s; given by (4) are on the negative real axis for 
| P; | S 1 and occur in complex conjugate pairs for | P; | > 1. When P; 
is complex, (3) involves a term [cosh (rs) — P#] which gives poles that 
are the complex conjugates of those of (4). 

It is easy to see from (4) that the infinite set of poles generated by one 


* For P = +1 double roots occur but not for P = +1 withn = 0. 
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denominator factor lie on a parabola given by 


oy In’ | p; | wT 

Ce A py (6) 

Figure 4 shows the location of these poles in the normalized, r = 1: 
s-plane. The poles due to the term [cosh (rs)? — P,] are indicated by 
single circles and those due to the term [cosh (rs)? — P*] by double 
circles. Similar comments hold in the case of numerator factors in (8). 
Knowing the locations of the poles of G(s) permits the question of 
stability to be answered easily. For simplicity we assume that in (2) 
bo = 0. If this is not the case, G(s) can be separated into the sum of a 
constant plus a G(s) which is of the form of (2) where G(s) has the same 
denominator as G(s) but different numerator and b) = 0. The constant 
gain is stable. With b) = 0, G(s) is stable, that is, its impulse response 
remains bounded for large values of time, if all the poles lie in the left 


NORMALIZED S- PLANE jw 


bn? |p| -—2— 
oo = n Pj =, 
* ane [pil lapel pi + 49r)*--—- 







n=te © >| (arg pi t+ 2ar)? 


EN cco dat -2£n2 [pil 
AX eo targ pu? 
o 
©) 
n=-1 
— --(arg pi—e2ar)? 


n=-2 
ol ecianig cae 


—+| Ln? | pil 


Fig. 4— S-plane roots resulting from a pair of complex conjugate factors in 
G(s). 
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half of the s-plane and those on the jw axis are simple. This result can be 
proved by finding the inverse transform of G(s) by using the Cauchy 
residue calculus.” 

Notice that G(s) is a meromorphic function and G(s) > 0 ass— ©. 
The Laplace inversion relation is written as 


atin 

This integral is evaluated by closing the contour in the left half plane 
so that it does not pass through any poles of G(s) and encloses a 
finite number of poles. The value of the closed contour integral is 
determined by the residues of the poles enclosed. As n > © Yn > © 
and the contours in the left half plane become larger without bound. 
Using Jordan’s lemma” the integral over the left half plane contour 
approaches zero and g(t) is determined. For large values of time the 
behavior of g(t) is dominated by that pole with the most positive 
real part. The stability requirement follows directly from this. 


Ill. TRANSFER FUNCTION SYNTHESIS 


A glance at Fig. 4 shows that, if the n = 0 pole is close to the jw axis, 
the response of the network will approximate that of this single pole 
alone for values of w near the pole. An examination of (4) shows that this 
dominance can always be made to occur by an appropriate selection of r. 
From (4) the pole positions in the s-plane are proportional to 77°. 
Therefore, by decreasing 7 the poles become more widely spaced and 
hence those near the jw axis become more dominant. Since p; can be 
adjusted so as to cause the n = 0 pole to be arbitrarily close to the 
s-plane origin, a decrease in 7 can be offset, for the n = 0 pole, by chang- 
ing p; . Therefore, the » = 0 pole can be made dominant. Hence, a 
rational transfer function can be approximated by the system considered 
here by making its dominant poles and zeros match those of the desired 
rational function. To calculate the feedback and feed forward coeff- 
cients of (2) we calculate the P; and Z; of (8) by using the desired pole 
or zero for s; in 


oY = cosh (zs,)' (6) 
and multiply the factors in (8). 


The scale factor 7 controls the dominance of the n = 0 poles and zeros; 
the dominance improves as 7 is reduced. A lower limit on practical 
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values of 7 occurs because the network sensitivity generally deteriorates 
with reduced values of 7. An upper limit on 7 occurs because the n = 0 
poles are restricted to the shaded area of Fig. 5. If 7 is large enough so 
that a desired dominant pole s; lies outside the shaded region, the net- 
work will realize this pole for a nonzero value of n. It is clear from Fig. 4 
that the network will then have an n = 0 pole with a more positive real 
part than that of s; . This pole can destroy the desired dominance or 
cause instability if it lies in the right half s-plane. The region permitted 
for n = O poles in Fig. 5 is determined from (4) by setting n = 0, 
substituting a value for w; and solving for the most negative value of ¢; . 
The resulting restriction is 


2 
foo Teed ea 
O22 a; a (7) 


Except in rather unusual situations + will be much smaller than the 
maximum implied by (7). 

To synthesize an approximation of a given rational transfer func- 
tion, the following simple steps are performed. 


(1) + is selected so that all the poles of the transfer function lie in 
the region shown in Fig. 5, and the resulting n = 0 poles and zeros 
realized by the RC line are dominant. 





Fig. 5— Permitted n = 0 pole positions in s-plane, 
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(1) The desired transfer function poles and zeros are used in (6) 
to determine the P; and Z; which, when substituted in (3) and multi- 
plied out, yield the feedback and feed forward coefficients for the net- 
work. 

(iit) The exact response of the network is calculated using (2) or 
(3) to verify that a good approximation has been achieved. 


As pointed out in the examples in Section VI, a wide range of values 
of + gives a very accurate approximation. Thus, a little experience 
will then make step 77 unnecessary. The selection of + also affects the 
sensitivity of the network; hence sensitivity considerations may de- 
termine the best value of r. 


IV. SENSITIVITY 


One of the most important aspects of any active network synthesis 
technique is its sensitivity to various parameter variations. In addi- 
tion, sensitivity results are necessary to show how a physical network 
may be tuned to achieve an accurate realization of the requirements. 
Of the several different sensitivity functions that could be derived, 
we have chosen to consider the relative changes of the poles and 
zeros with a variety of parameters. These seem to give good physical 
insight into the behavior of the circuit and result in reasonably con- 
cise expressions. The sensitivity functions derived are the relative 
changes of the poles resulting from relative changes in feedback 
coefficients, r, tap positions, and tap loading. Similar results hold for 
the zero sensitivity functions. The details of the derivations are con- 
tained in the Appendix. 

If A; is the pole in question and the sensitivity of that pole to some 
parameter X is defined as 


rs OM X 
* OK K;? 
and Pg = cosh (rA,) where A, is the qth pole, then we have the fol- 
lowing: * 
(1) Pole sensitivity to feedback coefficients: 
a; cosh (L — 1)(rn,)? 


( . 
2*(7),)! sinh (rd,)* TI (P; — Py) 
k=1 


7 





s (8) 


* i; is assumed to be a simple pole. 
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Simple relations for determining the numerator of (8) are given in 
Appendix equations (20) and (21). 
(2) Pole sensitivity to RC product, 7: 


S¥ = 1. (9) 
(112) Pole sensitivity to improper tap spacing* 
. . ae ; 4 
GM = ga SUE SON ee F, (10a) 
2"-? sinh (r),)* [] (P; — P,) 
k=1 


Ea] 


L-1 
L >> ¢ sinh (L — k)(r),)? 
Si ae SS ee (10b) 
2”~? sinh (7r;)? II (P; — P,) 
k=1 
44 


(iv) Pole sensitivity to tap loading: 


i-l 
g:R cosh (L — 4)(7d;)? D> e, sinh (¢ — k)(7d;)4 
SS (11) 
Qed; sinh (rd;)? Il (P; = P,) 
k=] 


ae] 


where g; is the conductance loading the 7th tap and R = dor. 


V. SECOND ORDER DESIGN EQUATIONS 


For the case where LZ = 2, that is, where the RC line is realizing an 
approximation to a second order transfer function H(s), the design 
and sensitivity relations given above take on the very simple forms 
below (A and p, which are complex, are the pole and zero positions in 
the upper left half s plane): 

P = cosh (r))3, Z = cosh (rp)? 

d = 0, a, = 4 Re(P), a = — Q+2{|P]*) 

bo = 6, = 0, b, = 1 for H(s) with no finite zeros 


bo = 1,6; = —4 Re(Z), b, = 14 2| Z|’ for H(s) with finite complex 
Zeros 


II 





* l,do = x1 where x is the distance from the input of the RC line to the 7th tap. 
Nominally ); = 2. 
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bo = 0, b, = 1, b. = — 1 for H(s) with one zero at zero 
S, = —-1 


9 = 2P Re (P) gh = -1 +2|P |) 
ox ~ j(7d)* sinh (7))* Im (P)’ #2 “~ 23(rd)* sinh (7d)? Im (P) 


ee: —2 Re (P) ge _ 2p" 
"jg Im(P) ’ '* “7 Im (P) 
Ss = —g,RP Ss = g.htP* 
%  Qjrh Im (P)’ °2 «jth Im (P) 
" 277 —1 : , 
St. = for H(s) with finite complex zeros. 


~ -2j(zp)* sinh (zp)? Im (Z) 


For the case where H(s) has two complex zeros, the zero sensitivities are 
the same as for the poles with p and Z replacing \ and P, except for S$, 
which is given. For the case where H(s) has a zero at zero and at infinity, 
the sensitivity of the zero at zero is infinite (due to the normalization by 
1/p), but unnormalized, 


Op _ dp _ _ 3 op _ _ 2h 
0b; 7 Obs = T and 092 7 T 


Other sensitivities not given are zero. 


VI. EXAMPLES 


Two examples of approximations to second order rational transfer 
functions will be worked out and compared with experimental results 
achieved with a thin film line. The two functions to be approximated 
are, normalized in frequency, 


_ (8/4) +1 
0.01 s 
0 lee aT ve 


The first is a noncritical low-pass function with a pair of zeros on the jw 
axis and the second is a band-pass function with a Q of 100. 

For the low-pass function (12), sensitivity is not a problem be- 
cause the poles are very low Q. Therefore, 7 can be selected to satisfy 
(7). and to insure dominance of the poles. Letting s = 1 we have the 
following results: 
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1=(-14+)/(2'}, P = 0.646 + 0.313), 
p= +44, Z = 0.342 + 71.9 
a =0, a, = 2.59, a, = —2.032 
b=1, b =—136, bb =8.5, K = 0.0544 
S, = 3.33 2154°, S*, = 3.652 —52° 
* = 4,12 290°, * = 46Z—116° 
S = 1.159¢,RZ—19°, SS, = 2.39.2 2109° 
Si, = 0.508 2125°, S?, = 0.162 —137°, Si, = 0.512 2 —37° 
? = 0.359 290°, * = 2.03 Z—170° 


S?, = 0.1269,2 280°, S?, 


0.254g,R Z 100° 


Figure 6 shows a block diagram of the experimental circuit, the theoreti- 
cal response, and the measured results. Notice that the theoretical 
response realized by the RC line and that of the rational function cannot 
be distinguished on the scale used for this figure, since they differ by 
1 percent at most. 

In the case of G(s) which has a pole with a Q = 100, dominance is 
achieved for a wide range of values of 7 for which (7) holds, and the 
selection of 7 is influenced primarily by sensitivity considerations. The 
parameter 7 affects the sensitivity in a rather complicated way as can 
be seen from the various sensitivity relations. An examination of the pole 
sensitivity to coefficient variations has shown that S*, has a rather broad 
minimum in the range 2 S 7 S 14 and that S%, goes to zero in this range 
when a, = 0. Therefore, without an exhaustive study to determine an 
optimum value of 7, we select that value which gives a, = 0, that is, 
7 = 4.94. With this value of 7 the following result: 


4 = —0.005 + 7 P = 72.34 
Ay = a, = 0 a = —11.95 
bp = 0, b, = 1, b, = —1 K = 0.051 
St, = 0.452 4 —45° Si, = 22180° 
S = 0.10159,R 290° S*, = 0.2039. 290° 
cl eel ae 0.405 2 180° Ls 0.405R 2 180° for the zero at zero. 


db, dbz 892 
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(b) 


Fig. 6 —(a) RC line with feedback approximating Gi(s). (b) Gain vs fre- 
quency for Gi(s). 


The block diagram of the experimental circuit, the theoretical re- 
sponse, and measured results are shown in Fig. 7. The difference between 
the theoretical gain of the RC line and that of the rational function is 
not noticeable since it is approximately 0.1 percent over the frequency 
range shown in the figure. 

The sensitivity of this network is quite acceptable. S*, can be con- 
trolled by the proper selection of impedance levels; 1, , a, and 7 can be 
stabilized so that the values of S*, , S*, and S* are satisfactory. I, should 
not change after manufacture; a, can be made to depend only on the 


2064 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1968 





— RATIONAL FUNCTION 
AND THEORETICAL 
RESPONSE OF DIS- 
TRIBUTED NETWORK 7 


EXPERIMENTAL 
RESULTS 


i] 
fo} 


-20 


GAIN IN DECIBELS 











-40 
0.80 0,85 0.90 0.95 1.00 1.05 1.10 1.45 
NORMALIZED FREQUENCY 
(b) 


Fig. 7— (a) RC line with feedback approximating G2(s). (b) Gain vs fre- 
quency for G(s). 


ratio of two resistors which track with temperature and 7 can be stabi- 
lized by selecting the temperature coefficients of the resistive and 
capacitive materials of the line to be negatives of one another.* 

Several final notes concerning the network are in order. By isolat- 
ing the taps on the line with emitter followers when necessary, it is 
possible to reduce to two the number of operational amplifiers in the 
network used for combining and scaling, one for the feedback voltages 
and another for the feed-forward voltages. When several of these net- 
works are cascaded, one of these two can be eliminated by using an 
operational amplifier from the suecceeeding network. One RC line can 
be constructed with a large number of taps. Then by selecting the ap- 
propriate set of taps, the line can be used for a variety of purposes 
and at different frequencies. 


* Tantalum resistors on a substrate can be made to track within + 5 ppm/°C and 
RC products can be made to track within + 30 ppm/°C. 
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Although only second-order examples were worked out and built, it 
is not unreasonable to expect that advances in building thin film RC 
lines and resistors using tantalum may eventually yield the stability 
of the various parameters required to make higher-order realizations 
possible. 


VII. CONCLUSIONS 


A network has beeen described which uses a single uniform RC 
line with feedback to approximate an arbitrary rational transfer 
function. The design procedure is simple as is the physical network. 
Theoretical calculations indicate that the transfer function realized 
by the RC line is an accurate approximation of the desired rational 
transfer function and measurements made on experimental circuits 
agree well with the theory. 
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APPENDIX 


Derivation of Sensitivity Expressions 


This appendix derives the sensitivity expressions given by (8) 
through (11). The sensitivity of quantity to parameter a is defined as 


n O\a 
S, = da (14) 
If s; is a network pole and D(s) is the denominator of (2), D(s;) = 
0. The equation D(s;) = 0 defines s; as an implicit function of the 
parameters in D(s). By differentiating the equation D(s;) = 0 with 
respect to a parameter a, we can determine the quantity 0s;/da«. This 
result will hold for general values of the various parameters in D(s). 
For the particular case when all the parameters in D(s) have their 
nominal values, s; will in fact be one of the desired network poles, 
that is, s; = A;. Furthermore, the factorization used in going from (2) 
to (8) can then be used to simplify the expression for 0A;/da. The 
sensitivity of A; to a is then determined by using (14). A similar pro- 
cedure using the numerator of (2) gives the sensitivity functions of 
the zeros. 
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A. 1 Sensitivity of Poles to Feedback Coefficients 
From (2) the denominator of G(s) is 


D(s) = ys c, cosh (L — k)(rs)*. (15) 


If s;is a root of D(s;) = 0, we have 
<i bo hs _ yy sinh (L = h)(rs,)* 7 | 
dc, D8) =) = | > oll h) 2(rs;)? dc; 
+ cosh (L — 1)(rs;)}. 
Setting all the parameters to their nominal values gives s; = A; . As 
shown in (18), the term in brackets is nonzero if \ is a simple pole and 


Th; # — n’x” where n is a nonzero integer. Therefore, solving the above 
equation gives 








ay cosh (L — i)(rd,)? (16) 
ac; x sinh (L — DIG as 
do c,(L — k) (rh, 


As was done in (3), (15) can be factored, when all parameters have 
their nominal values and P; = cosh (rA,) “, into 


L L 
(s) = >> e, cosh (L — k)(rs)? = 2”~* JI [cosh (7s)? — P,]. (17) 
k=0 k=1 
Differentiating this equation with respect to s gives 


4. (L — k) sinh (L _ hy(rs)* 
24° 2(rs)} 


= 9tn1 > aaron is I [cosh (7s)? — P,], 


em 


and letting s = A;, we have 





Se, eee sin kari)? _ gt URN! I (P; - 
i . 


(18) is nonzero provided }; is a simple pole and rA; # — n’x where nis 
nonzero integer. 
- Using this and dc,;/da; = — 1,7 # 0, with (16) gives 


a; cosh (L — 7)(r),)? 
LL 
2”~*(rd,)? sinh (rd;)? [] (P; — Py) 
k=l 


FT 


NG 
Si = 





(19) 
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The numerator can be simplified by expressing a; in terms of the P;, 
by using (17) and by factoring cosh(L — 7) (rA;)%. The results are 


Ao = 0 
L 
ay = 2 > P; 
i=l (20) 
L L 
a = —L — 2 vp] op, | 
i=) ae 
2 Ae a L * 
: a ete I, age 
and 
(L-i) 25 
cosh (L — (ray = 2? TT {P, — cos Ea Gee ale 
i#L. (21) 


A. 2 Sensitivity of Poles to Tap Position 


The tap positions are directly proportional to the integers & in (15). 
If & in (15) is replaced by J, which is no longer constrained to be an 
integer and s; is a root of the resulting D(s), we have 


L 
D(s;)) = >> c cosh (1, — 1,)(7s)*? = 0. 
k=0 


As in the previous section, differentiating with respect to J,, solving 
for ds;/dl,, using the nominal values l;, = k so that s; = A;, and using 
(18), we have 
aH ta; sinh (L aoe era (22) 
ie sinh (rd;)? II (P; => P,) 
k=1 
i 


and 


L 
L >S¢ sinh (L — k)(rd,)* 


so = — = —________. (22) 
2”? sinh (rd,)? [] (P; — Pd 
k=1 
44 





* Divide a by 2 ifi = L. 
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Fig. 8— RC line model for loading analysis. 


A.8 Pole Sensitivity to Tap Loading 


To calculate the sensitivity of a pole \; to the loading at the zth tap 
by a conductance g; , we assume that all other taps are not loaded, that 
is, g; = O for j ¥ 7, and calculate the voltage transfer function from the 
input to the various taps. Having found these, we calculate the denomi- 
nator of the system transfer function, D(s), and proceed to calculate 
0d,/0g; in the same way as was done in Section A.2. 

To calculate the voltage transfer function we will use the chain matrix 
description of the line which is 


Bees cosh k(rs)?  Z, sinh oe 
.= 
Zs! sinh k(rs)? cosh k(rs)? 


where Zp) = (r/cs)* and kdo is the length of the line. The line, loaded 
by g; at the ith tap, can be considered as the cascade connection of 
an RC line of length zd) connected to a two-port consisting solely of 
g; which in turn is connected to an RC line of length (L — 1)dp as seen 
in Fig. 8. The chain matrix of 9g; 1s 


A,, a ‘|. 
g: 1 


From Fig. 8 and the properties of the chain matrix we have for 0 


° . . 


and fori Sk Ss L 


* = Aral = 4 a) 
te —t, —ty, 


With 2; = 0 the above relations give the voltage transfer functions 
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from the line input to the kth tap as 
G,(s) = ex(s) /eo(s) _ Bas/Bou 


where B,,; is the (1, 1) element of the matrix B, . 
The matrices B, forO Sk <tandi< k <= L, are 


fo (L — k)(rs)* | } 
B. = + g;Zo sinh (i — k)(rs)* cosh (L — 7)(rs)* 
i es bh hoi 


and 
ae ina (L — k)(r8)? ss 
be} fe 


respectively, which give the gains G(s) as 


Gus) = CH (L = W(r9)* + geo sinh i — b)(r8)* cosh (L = i)(8)* 
* cosh L(rs)? + g;Zy sinh i(rs)* cosh (L — 7)(rs)* ; 


forO Sk <i,andfori7Sk<L 


— 





G@ = cosh (L — k)(rs)? 
cosh L(7s)' + g;Z sinh 2(7s)! cosh (L — 1)(zs)* 
By using these equations in the expression for the gain of the feed- 
back structure and multiplying the numerator and denominator of 
this expression by Bo11, we have the following expression for the 
denominator, D(s). 


L 
D(s) oy C.Birt 
k=0 


L 
= >> ¢, cosh (L — k)(rs)? 
k=0 


dG Fic Na’ Sesh GO: 
k=0 


Now proceeding as in the previous sections, let D(s;) = 0 and differ- 
entiate with respect to g; to get 


Os; = c,(L = k)r 


co 24 9078)! sinh (L — k)(rs;)? 


a: aay euch Lo DGahF Osh = GaN ois, 
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With 9; = 0, s; = A; and the above gives with (18) 


i-1 
g:R cosh (L — i)(rd;)* >> & sinh (¢ — &)(r),)? 
= a. 





L 
amar oe sinh (rh,)? Il (P; oe P;) 
k=1 


i 


A. 4 Sensitivity to RC Product Changes 


It is assumed that the product RC = 7 of the line is uniform but not 


correct. The sensitivity of the poles to changes in 7 is easily seen to be 


Sv=-1 


since + always appears multiplying s in the transfer function and is 
a frequency scale factor. 
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